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Preface to the second edition 


In the ten years that have passed since the publication of the first edition of this text¬ 
book exciting advances have taken place in every discipline of geophysics. 
Computer-based improvements in technology have led the way, allowing more 
sophistication in the acquisition and processing of geophysical data. Advances in 
mass spectrometry have made it possible to analyze minute samples of matter in 
exquisite detail and have contributed to an improved understanding of the origin of 
our planet and the evolution of the solar system. Space research has led to better 
knowledge of the other planets in the solar system, and has revealed distant objects 
in orbit around the Sun. As a result, the definition of a planet has been changed. 
Satellite-based technology has provided more refined measurement of the gravity 
and magnetic fields of the Earth, and has enabled direct observation from space of 
minute surface changes related to volcanic and tectonic events. The structure, com¬ 
position and dynamic behavior of the deep interior of the Earth have become better 
understood owing to refinements in seismic tomography. Fast computers and 
sophisticated algorithms have allowed scientists to construct plausible models of 
slow geodynamic behavior in the Earth’s mantle and core, and to elucidate the 
processes giving rise to the Earth’s magnetic field. The application of advanced 
computer analysis in high-resolution seismic reflection and ground-penetrating 
radar investigations has made it possible to describe subtle features of environmen¬ 
tal interest in near-surface structures. Rock magnetic techniques applied to sedi¬ 
ments have helped us to understand slow natural processes as well as more rapid 
anthropological changes that affect our environment, and to evaluate climates in the 
distant geological past. Climatic history in the more recent past can now be deduced 
from the analysis of temperature in boreholes. 

Although the many advances in geophysical research depend strongly on the aid 
of computer science, the fundamental principles of geophysical methods remain the 
same; they constitute the foundation on which progress is based. In revising this 
textbook, I have heeded the advice of teachers who have used it and who recom¬ 
mended that I change as little as possible and only as much as necessary (to para¬ 
phrase medical advice on the use of medication). The reviews of the first edition, the 
feedback from numerous students and teachers, and the advice of friends and col¬ 
leagues helped me greatly in deciding what to do. 

The structure of the book has been changed slightly compared to the first 
edition. The final chapter on geodynamics has been removed and its contents inte¬ 
grated into the earlier chapters, where they fit better. Text-boxes have been intro¬ 
duced to handle material that merited further explanation, or more extensive 
treatment than seemed appropriate for the body of the text. Two appendices have 
been added to handle more adequately the three-dimensional wave equation and the 
cooling of a half-space, respectively. At the end of each chapter is a list of review 
questions that should help students to evaluate their knowledge of what they have 
read. Each chapter is also accompanied by a set of exercises. They are intended to 
provide practice in handling some of the numerical aspects of the topics discussed 



in the chapter. They should help the student to become more familiar with geophys¬ 
ical techniques and to develop a better understanding of the fundamental princi¬ 
ples. 

The first edition was mostly free of errata, in large measure because of the 
patient, accurate and meticulous proofreading by my wife Marcia, whom I sincerely 
thank. Some mistakes still occurred, mostly in the more than 350 equations, and 
were spotted and communicated to me by colleagues and students in time to be cor¬ 
rected in the second printing of the first edition. Regarding the students, this did not 
improve (or harm) their grades, but I was impressed and pleased that they were 
reading the book so carefully. Among the colleagues, I especially thank Bob 
Carmichael for painstakingly listing many corrections and Ray Brown for posing 
important questions. Constructive criticisms and useful suggestions for additions 
and changes to the individual revised chapters in this edition were made by Mark 
Bukowinski, Clark Wilson, Doug Christensen, Jim Dewey, Henry Pollack, 
Ladislaus Rybach, Chris Heinrich, Hans-Ruedi Maurer and Mike Fuller. I am very 
grateful to these colleagues for the time they expended and their unselfish efforts to 
help me. If errors persist in this edition, it is not their fault but due to my negligence. 

The publisher of this textbook, Cambridge University Press, is a not-for-profit 
charitable institution. One of their activities is to promote academic literature in the 
“third world.” With my agreement, they decided to publish a separate low-cost 
version of the first edition, for sale only in developing countries. This version 
accounted for about one-third of the sales of the first edition. As a result, earth 
science students in developing countries could be helped in their studies of geo¬ 
physics; several sent me appreciative messages, which I treasure. 

The bulk of this edition has been written following my retirement two years ago, 
after 30 years as professor of geophysics at ETH Zurich. My new emeritus status 
should have provided lots of time for the project, but somehow it took longer than I 
expected. My wife Marcia exhibited her usual forbearance and understanding for 
my obsession. I thank her for her support, encouragement and practical sugges¬ 
tions, which have been as important for this as for the first edition. This edition is 
dedicated to her, as well as to my late parents. 

William Lowrie 
Zurich 
August, 2006 
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1 The Earth as a planet 


1.1 THE SOLAR SYSTEM 

1.1.1 The discovery and description of the planets 

To appreciate how impressive the night sky must have 
been to early man it is necessary today to go to a place 
remote from the distracting lights and pollution of urban 
centers. Viewed from the wilderness the firmaments 
appear to the naked eye as a canopy of shining points, 
fixed in space relative to each other. Early observers noted 
that the star pattern appeared to move regularly and used 
this as a basis for determining the timing of events. More 
than 3000 years ago, in about the thirteenth century 
BC, the year and month were combined in a working cal¬ 
endar by the Chinese, and about 350 BC the Chinese 
astronomer Shih Shen prepared a catalog of the positions 
of 800 stars. The ancient Greeks observed that several 
celestial bodies moved back and forth against this fixed 
background and called them the planetes, meaning “wan¬ 
derers.” In addition to the Sun and Moon, the naked eye 
could discern the planets Mercury, Venus, Mars, Jupiter 
and Saturn. 

Geometrical ideas were introduced into astronomy by 
the Greek philosopher Thales in the sixth century BC. This 
advance enabled the Greeks to develop astronomy to its 
highest point in the ancient world. Aristotle (384—322 BC) 
summarized the Greek work performed prior to his time 
and proposed a model of the universe with the Earth at its 
center. This geocentric model became imbedded in reli¬ 
gious conviction and remained in authority until late into 
the Middle Ages. It did not go undisputed; Aristarchus of 
Samos (c.310-c.230 BC) determined the sizes and dis¬ 
tances of the Sun and Moon relative to the Earth and 
proposed a heliocentric (sun-centered) cosmology. The 
methods of trigonometry developed by Hipparchus 
(190-120 BC) enabled the determination of astronomical 
distances by observation of the angular positions of celes¬ 
tial bodies. Ptolemy, a Greco-Egyptian astronomer in the 
second century AD, applied these methods to the known 
planets and was able to predict their motions with remark¬ 
able accuracy considering the primitiveness of available 
instrumentation. 

Until the invention of the telescope in the early seven¬ 
teenth century the main instrument used by astronomers 
for determining the positions and distances of heavenly 
bodies was the astrolabe. This device consisted of a disk 
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Fig. 1.1 Illustration of the method of parallax in which two measured 
angles (0, and 0 2 ) are used to compute the distances (p, and p 2 ) of a 
planet from the Earth in terms of the Earth-Sun distance (s). 

of wood or metal with the circumference marked off in 
degrees. At its center was pivoted a movable pointer 
called the alidade. Angular distances could be deter¬ 
mined by sighting on a body with the alidade and reading 
off its elevation from the graduated scale. The inventor of 
the astrolabe is not known, but it is often ascribed to 
Hipparchus (190-120 BC). It remained an important tool 
for navigators until the invention of the sextant in the 
eighteenth century. 

The angular observations were converted into dis¬ 
tances by applying the method of parallax. This is simply 
illustrated by the following example. Consider the planet 
P as viewed from the Earth at different positions in the 
latter’s orbit around the Sun (Fig. 1.1). For simplicity, 
treat planet P as a stationary object (i.e., disregard the 
planet’s orbital motion). The angle between a sighting on 
the planet and on a fixed star will appear to change 
because of the Earth’s orbital motion around the Sun. 
Let the measured extreme angles be 9 1 and 0 2 and the 
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distance of the Earth from the Sun be 5 ; the distance 
between the extreme positions E and E' of the orbit is 
then 2s. The distances p, and p 2 of the planet from the 
Earth are computed in terms of the Earth-Sun distance 
by applying the trigonometric law of sines: 

Pi sin(90 — 0 2 ) cosd 2 
2s sin(0,+0,) sin(0, + d 2 ) 

P2 _ cose, < u > 

2s sintPj + Sj) 

Further trigonometric calculations give the distances 
of the planets from the Sun. The principle of parallax 
was also used to determine relative distances in the 
Aristotelian geocentric system, according to which the 
fixed stars, Sun, Moon and planets are considered to be in 
motion about the Earth. 

In 1543, the year of his death, the Polish astronomer 
Nicolas Copernicus published a revolutionary work in 
which he asserted that the Earth was not the center of the 
universe. According to his model the Earth rotated about 
its own axis, and it and the other planets revolved about 
the Sun. Copernicus calculated the sidereal period of each 
planet about the Sun; this is the time required for a planet 
to make one revolution and return to the same angular 
position relative to a fixed star. He also determined the 
radii of their orbits about the Sun in terms of the 
Earth-Sun distance. The mean radius of the Earth’s orbit 
about the Sun is called an astronomical unit ; it equals 
149,597,871km. Accurate values of these parameters 
were calculated from observations compiled during an 
interval of 20 years by the Danish astronomer Tycho 
Brahe (1546-1601). On his death the records passed to his 
assistant, Johannes Kepler (1571-1630). Kepler suc¬ 
ceeded in fitting the observations into a heliocentric model 
for the system of known planets. The three laws in which 
Kepler summarized his deductions were later to prove 
vital to Isaac Newton for verifying the law of Universal 
Gravitation. It is remarkable that the database used by 
Kepler was founded on observations that were unaided by 
the telescope, which was not invented until early in the sev¬ 
enteenth century. 

1.1.2 Kepler's laws of planetary motion 

Kepler took many years to fit the observations of Tycho 
Brahe into three laws of planetary motion. The first and 
second laws (Fig. 1.2) were published in 1609 and the 
third law appeared in 1619. The laws may be formulated 
as follows: 

(1) the orbit of each planet is an ellipse with the Sun at 
one focus; 

(2) the orbital radius of a planet sweeps out equal areas 
in equal intervals of time; 

(3) the ratio of the square of a planet’s period ( T 2 ) to the 
cube of the semi-major axis of its orbit (a 3 ) is a con¬ 
stant for all the planets, including the Earth. 



Fig. 1.2 Kepler's first two laws of planetary motion: (1) each planetary 
orbit is an ellipse with the Sun at one focus, and (2) the radius to a 
planet sweeps out equal areas in equal intervals of time. 


Kepler’s three laws are purely empirical, derived from 
accurate observations. In fact they are expressions of 
more fundamental physical laws. The elliptical shapes of 
planetary orbits (Box 1.1) described by the first law are a 
consequence of the conservation of energy of a planet 
orbiting the Sun under the effect of a central attraction 
that varies as the inverse square of distance. The second 
law describing the rate of motion of the planet around its 
orbit follows directly from the conservation of angular 
momentum of the planet. The third law results from the 
balance between the force of gravitation attracting the 
planet towards the Sun and the centrifugal force away 
from the Sun due to its orbital speed. The third law is 
easily proved for circular orbits (see Section 2.3.2.3). 

Kepler’s laws were developed for the solar system but 
are applicable to any closed planetary system. They govern 
the motion of any natural or artificial satellite about a 
parent body. Kepler’s third law relates the period ( T) and 
the semi-major axis (a) of the orbit of the satellite to the 
mass (M) of the parent body through the equation 



where G is the gravitational constant. This relationship 
was extremely important for determining the masses of 
those planets that have natural satellites. It can now be 
applied to determine the masses of planets using the 
orbits of artificial satellites. 

Special terms are used in describing elliptical orbits. 
The nearest and furthest points of a planetary orbit 
around the Sun are called perihelion and aphelion , respec¬ 
tively. The terms perigee and apogee refer to the corre¬ 
sponding nearest and furthest points of the orbit of the 
Moon or a satellite about the Earth. 

1.1.3 Characteristics of the planets 

Galileo Galilei (1564-1642) is often regarded as a founder 
of modern science. He made fundamental discoveries in 
astronomy and physics, including the formulation of the 
laws of motion. He was one of the first scientists to use 
the telescope to acquire more detailed information about 
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Box 1.1: Orbital 

The orbit of a planet or comet in the solar system is an 
ellipse with the Sun at one of its focal points. This con¬ 
dition arises from the conservation of energy in a force 
field obeying an inverse square law. The total energy 
( E) of an orbiting mass is the sum of its kinetic energy 
(K) and potential energy ({/). For an object with mass 
m and velocity v in orbit at distance r from the Sun 
(mass S ) 

^ mv 2 — G yr = E = constant (1) 

If the kinetic energy is greater than the potential 
energy of the gravitational attraction to the Sun (£>0), 
the object will escape from the solar system. Its path is a 
hyperbola. The same case results if E = 0, but the path is 
a parabola. If E< 0, the gravitational attraction binds 
the object to the Sun; the path is an ellipse with the Sun 
at one focal point (Fig. Bl.1.1). An ellipse is defined as 
the locus of all points in a plane whose distances Sj and 
s 2 from two fixed points Fj and F 2 in the plane have a 
constant sum, defined as 2a: 

s i + s 2~ 2a (2) 

The distance 2 a is the length of the major axis of the 
ellipse; the minor axis perpendicular to it has length 2b, 
which is related to the major axis by the eccentricity of 
the ellipse, e: 


e =^F-f’ ® 

The equation of a point on the ellipse with Cartesian 
coordinates (x, y ) defined relative to the center of the 
figure is 


The elliptical orbit of the Earth around the Sun 
defines the ecliptic plane. The angle between the orbital 
plane and the ecliptic is called the inclination of the 
orbit, and for most planets except Mercury (inclination 
7°) and Pluto (inclination 17°) this is a small angle. A 
line perpendicular to the ecliptic defines the North and 
South ecliptic poles. If the fingers of one’s right hand 
are wrapped around Earth’s orbit in the direction of 
motion, the thumb points to the North ecliptic pole, 
which is in the constellation Draco (“the dragon”). 
Viewed from above this pole, all planets move around 
the Sun in a counterclockwise (prograde) sense. 



Fig. Bl.1.1 The parameters of an elliptical orbit. 


Pole to 
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summer 
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Fig. B1.1.2 The relationship between the ecliptic plane, Earth's 
equatorial plane and the line of equinoxes. 


The rotation axis of the Earth is tilted away from 
the perpendicular to the ecliptic forming the angle of 
obliquity (Fig. Bl.l.2), which is currently 23.5°. The 
equatorial plane is tilted at the same angle to the eclip¬ 
tic, which it intersects along the line of equinoxes. 
During the annual motion of the Earth around the Sun, 
this line twice points to the Sun: on March 20, defining 
the vernal (spring) equinox, and on September 23, 
defining the autumnal equinox. On these dates day and 
night have equal length everywhere on Earth. The 
summer and winter solstices occur on June 21 and 
December 22, respectively, when the apparent motion of 
the Sun appears to reach its highest and lowest points in 
the sky. 


the planets. In 1610 Galileo discovered the four largest 
satellites of Jupiter (called Io, Europa, Ganymede and 
Callisto), and observed that (like the Moon) the planet 
Venus exhibited different phases of illumination, from full 


disk to partial crescent. This was persuasive evidence in 
favor of the Copernican view of the solar system. 

In 1686 Newton applied his theory of Universal 
Gravitation to observations of the orbit of Callisto and 
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Table 1.1 Dimensions and rotational characteristics of the planets (data sources: Beatty et al., 1999; McCarthy and Petit, 
2004; National Space Science Data Center, 2004 [http:llnssdc.gsfc.nasa.govlplaneta.ryl]) 

The great planets and Pluto are gaseous. For these planets the surface on which the pressure is 1 atmosphere is taken as the 
effective radius. In the definition of polar flattening, a and c are respectively the semi-major and semi-minor axes of the 
spheroidal shape. 


Planet 

Mass 

A/[10 24 kg] 

Mass 
relative 
to Earth 

Mean 

density 

[kgm~ 3 ] 

Equatorial 
radius [km] 

Sidereal 
rotation 
period [days] 

Polar 
flattening 
f—{a — c)la 

Obliquity 
of rotation 
axis [°] 

Terrestrial planets and the Moon 
Mercury 0.3302 

0.0553 

5,427 

2,440 

58.81 

0.0 

0.1 

Venus 

4.869 

0.815 

5,243 

6,052 

243.7 

0.0 

177.4 

Earth 

5.974 

1.000 

5,515 

6,378 

0.9973 

0.003353 

23.45 

Moon 

0.0735 

0.0123 

3,347 

1,738 

27.32 

0.0012 

6.68 

Mars 

0.6419 

0.1074 

3,933 

3,397 

1.0275 

0.00648 

25.19 

Great planets and Pluto 

Jupiter 

1,899 

317.8 

1,326 

71,492 

0.414 

0.0649 

3.12 

Saturn 

568.5 

95.2 

687 

60,268 

0.444 

0.098 

26.73 

Uranus 

86.8 

14.4 

1,270 

25,559 

0.720 

0.023 

97.86 

Neptune 

102.4 

17.15 

1,638 

24,766 

0.671 

0.017 

29.6 

Pluto 

0.125 

0.0021 

1,750 

1,195 

6.405 

— 

122.5 


calculated the mass of Jupiter (J) relative to that of the 
Earth ( E). The value of the gravitational constant G was 
not yet known; it was first determined by Lord Cavendish 
in 1798. However, Newton calculated the value of GJ to 
be 124,400,000 km 3 s -2 . This was a very good determina¬ 
tion; the modern value for GJ is 126,712,767 km 3 s -2 . 
Observations of the Moon's orbit about the Earth 
showed that the value GE was 398,600 km 3 s -2 . Hence 
Newton inferred the mass of Jupiter to be more than 300 
times that of the Earth. 

In 1781 William Herschel discovered Uranus, the first 
planet to be found by telescope. The orbital motion of 
Uranus was observed to have inconsistencies, and it was 
inferred that the anomalies were due to the perturbation 
of the orbit by a yet undiscovered planet. The predicted 
new planet, Neptune, was discovered in 1846. Although 
Neptune was able to account for most of the anomalies of 
the orbit of Uranus, it was subsequently realized that 
small residual anomalies remained. In 1914 Percival 
Lowell predicted the existence of an even more distant 
planet, the search for which culminated in the detection of 
Pluto in 1930. 

The masses of the planets can be determined by apply¬ 
ing Kepler’s third law to the observed orbits of natural 
and artificial satellites and to the tracks of passing space¬ 
craft. Estimation of the sizes and shapes of the planets 
depends on data from several sources. Early astronomers 
used occultations of the stars by the planets; an occupa¬ 
tion is the eclipse of one celestial body by another, such as 
when a planet passes between the Earth and a star. The 
duration of an occupation depends on the diameter of the 
planet, its distance from the Earth and its orbital speed. 

The dimensions of the planets (Table 1.1) have been 
determined with improved precision in modern times by the 


availability of data from spacecraft, especially from radar¬ 
ranging and Doppler tracking (see Box 1.2). Radar-ranging 
involves measuring the distance between an orbiting (or 
passing) spacecraft and the planet’s surface from the two- 
way travel-time of a pulse of electromagnetic waves in the 
radar frequency range. The separation can be measured 
with a precision of a few centimeters. If the radar signal is 
reflected from a planet that is moving away from the space¬ 
craft the frequency of the reflection is lower than that of the 
transmitted signal; the opposite effect is observed when the 
planet and spacecraft approach each other. The Doppler 
frequency shift yields the relative velocity of the spacecraft 
and planet. Together, these radar methods allow accurate 
determination of the path of the spacecraft, which is 
affected by the mass of the planet and the shape of its grav¬ 
itational equipotential surfaces (see Section 2.2.3). 

The rate of rotation of a planet about its own axis can 
be determined by observing the motion of features on its 
surface. Where this is not possible (e.g., the surface of 
Uranus is featureless) other techniques must be 
employed. In the case of Uranus the rotational period of 
17.2 hr was determined from periodic radio emissions 
produced by electrical charges trapped in its magnetic 
field; they were detected by the Voyager 2 spacecraft 
when it flew by the planet in 1986. All planets revolve 
around the Sun in the same sense, which is counterclock¬ 
wise when viewed from above the plane of the Earth’s 
orbit (called the ecliptic plane). Except for Pluto, the 
orbital plane of each planet is inclined to the ecliptic at a 
small angle (Table 1.2). Most of the planets rotate about 
their rotation axis in the same sense as their orbital 
motion about the Sun, which is termed prograde. Venus 
rotates in the opposite, retrograde sense. The angle 
between a rotation axis and the ecliptic plane is called the 
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Box 1.2: Radar and the Doppler effect 


The name radar derives from the acronym for RA dio 
Detection And Ranging, a defensive system developed 
during World War II for the location of enemy aircraft. 
An electromagnetic signal in the microwave frequency 
range (see Fig. 4.59), consisting of a continuous wave or 
a series of short pulses, is transmitted toward a target, 
from which a fraction of the incident energy is reflected 
to a receiver. The laws of optics for visible light apply 
equally to radar waves, which are subject to reflection, 
refraction and diffraction. Visible light has short wave¬ 
lengths (400-700 nm) and is scattered by the atmos¬ 
phere, especially by clouds. Radar signals have longer 
wavelengths (~lcm to 30 cm) and can pass through 
clouds and the atmosphere of a planet with little disper¬ 
sion. The radar signal is transmitted in a narrow beam 
of known azimuth, so that the returning echo allows 
exact location of the direction to the target. The signal 
travels at the speed of light so the distance, or range, to 
the target may be determined from the time difference at 
the source between the transmitted and reflected signals. 

The transmitted and reflected radar signals lose 
energy in transit due to atmospheric absorption, but 
more importantly, the amplitude of the reflected signal is 
further affected by the nature of the reflecting surface. 
Each part of the target’s surface illuminated by the radar 
beam contributes to the reflected signal. If the surface is 
inclined obliquely to the incoming beam, little energy 
will reflect back to the source. The reflectivity and rough¬ 
ness of the reflecting surface determine how much of the 
incident energy is absorbed or scattered. The intensity of 
the reflected signal can thus be used to characterize the 
type and orientation of the reflecting surface, e.g., 
whether it is bare or forested, flat or mountainous. 

The Doppler effect, first described in 1842 by an 
Austrian physicist, Christian Doppler, explains how the 
relative motion between source and detector influences 
the observed frequency of light and sound waves. For 


example, suppose a stationary radar source emits a 
signal consisting of n 0 pulses per second. The frequency 
of pulses reflected from a stationary target at distance d 
is also n 0 , and the two-way travel-time of each pulse is 
equal to 2 (die), where c is the speed of light. If the target 
is moving toward the radar source, its velocity v shortens 
the distance between the radar source and the target by 
(vtIT), where t is the new two-way travel-time: 


,7 F-<."" 2 > L,„- 


t = 2 




( 1 ) 


t = t 0 /( 1 + v/c) 


( 2 ) 


The travel-time of each reflected pulse is shortened, 
so the number of reflected pulses (n) received per second 
is correspondingly higher than the number emitted: 


n = « 0 ( 1 + v/c) 


(3) 


The opposite situation arises if the target is moving 
away from the source: the frequency of the reflected 
signal is lower than that emitted. Similar principles 
apply if the radar source is mounted on a moving plat¬ 
form, such as an aircraft or satellite. The Doppler 
change in signal frequency in each case allows remote 
measurement of the relative velocity between an object 
and a radar transmitter. 

In another important application, the Doppler effect 
provides evidence that the universe is expanding. The 
observed frequency of light from a star (i.e., its color) 
depends on the velocity of its motion relative to an 
observer on Earth. The color of the star shifts toward 
the red end of the spectrum (lower frequency) if the star 
is receding from Earth and toward the blue end (higher 
frequency) if it is approaching Earth. The color of light 
from many distant galaxies has a “red shift,” implying 
that these galaxies are receding from the Earth. 


obliquity of the axis. The rotation axes of Uranus and 
Pluto lie close to their orbital planes; they are tilted away 
from the pole to the orbital plane at angles greater than 
90°, so that, strictly speaking, their rotations are also 
retrograde. 

The relative sizes of the planets are shown in Fig. 1.3. 
They form three categories on the basis of their physical 
properties (Table 1.1). The terrestrial planets (Mercury, 
Venus, Earth and Mars) resemble the Earth in size and 
density. They have a solid, rocky composition and they 
rotate about their own axes at the same rate or slower 
than the Earth. The great, or Jovian, planets (Jupiter, 
Saturn, Uranus and Neptune) are much larger than the 
Earth and have much lower densities. Their compositions 
are largely gaseous and they rotate more rapidly than the 
Earth. Pluto’s large orbit is highly elliptical and more 


steeply inclined to the ecliptic than that of any other 
planet. Its physical properties are different from both the 
great planets and the terrestrial planets. These nine bodies 
are called the major planets. There are other large objects 
in orbit around the Sun, called minor planets, which do 
not fulfil the criteria common to the definition of the 
major planets. The discovery of large objects in the solar 
system beyond the orbit of Neptune has stimulated 
debate among astronomers about what these criteria 
should be. As a result, Pluto has been reclassified as a 
“ dwarf planet .” 

1.1.3.1 Bode'slaw 

In 1772 the German astronomer Johann Bode devised an 
empirical formula to express the approximate distances of 
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Table 1.2 Dimensions and characteristics of the planetary orbits (data sources: Beatty et al., 1999; McCarthy and Petit, 
2004; National Space Science Data Center, 2004 [http:llnssdc.gsfc.nasa.gov/planetaryl]) 


Planet 

Mean 

orbital radius 
[AU] 

Semi-major 
axis [10 6 km] 

Eccentricity 
of orbit 

Inclination 
of orbit to 
ecliptic [°] 

Mean orbital 
velocity 
[km s- 1 ] 

Sidereal 
period of 
orbit [yr] 

Terrestrial planets and the Moon 
Mercury 0.3830 

57.91 

0.2056 

7.00 

47.87 

0.2408 

Venus 

0.7234 

108.2 

0.0068 

3.39 

35.02 

0.6152 

Earth 

1.0000 

149.6 

0.01671 

0.0 

29.79 

1.000 

Moon 

0.00257 

0.3844 

0.0549 

5.145 

1.023 

0.0748 

(about Earth) 
Mars 

1.520 

227.9 

0.0934 

1.85 

24.13 

1.881 

Great planets and Pluto 

Jupiter 5.202 

778.4 

0.0484 

1.305 

13.07 

11.86 

Saturn 

9.576 

1,427 

0.0542 

2.484 

9.69 

29.4 

Uranus 

19.19 

2,871 

0.0472 

0.77 

6.81 

83.7 

Neptune 

30.07 

4,498 

0.00859 

1.77 

5.43 

164.9 

Pluto 

38.62 

5,906 

0.249 

17.1 

4.72 

248 


Mercury Venus Earth Mars 

o o • ° 



Fig. 1.3 The relative sizes of the planets: (a) the terrestrial planets, (b) 
the great (Jovian) planets and (c) Pluto, which is diminutive compared to 
the others. 

the planets from the Sun. A series of numbers is created in 
the following way: the first number is zero, the second is 
0.3, and the rest are obtained by doubling the previous 
number. This gives the sequence 0, 0.3, 0.6, 1.2, 2.4, 4.8, 

9.6, 19.2, 38.4, 76.8, etc. Each number is then augmented 
by 0.4 to give the sequence: 0.4,0.7,1.0,1.6,2.8, 5.2,10.0, 

19.6, 38.8, 77.2, etc. This series can be expressed mathe¬ 
matically as follows: 

d n = 0.4 for n = 1 

d n = 0.4 + 0.3 X 2 n ~ 2 for « > 2 (L3) 

This expression gives the distance d n in astronomical 
units (AU) of the nth planet from the Sun. It is usually 
known as Bode’s law but, as the same relationship had 
been suggested earlier by J. D. Titius of Wittenberg, it is 
sometimes called Titius-Bode's law. Examination of Fig. 
1.4 and comparison with Table 1.2 show that this rela¬ 
tionship holds remarkably well, except for Neptune and 
Pluto. A possible interpretation of the discrepancies is 


100 n 




0 . 1-1 -.. . .■. . .■. . 

0.1 1 10 100 
Distance from Sun (AU) 
predicted by Bodes law 

Fig. 1.4 Bode's empirical law for the distances of the planets from 
the Sun. 

that the orbits of these planets are no longer their original 
orbits. 

Bode's law predicts a fifth planet at 2.8 AU from the 
Sun, between the orbits of Mars and Jupiter. In the last 
years of the eighteenth century astronomers searched 
intensively for this missing planet. In 1801 a small plane¬ 
toid, Ceres, was found at a distance of 2.77 AU from the 
Sun. Subsequently, it was found that numerous small 
planetoids occupied a broad band of solar orbits centered 
about 2.9 AU, now called the asteroid belt. Pallas was 
found in 1802, Juno in 1804, and Vesta, the only asteroid 
that can be seen with the naked eye, was found in 1807. By 
1890 more than 300 asteroids had been identified. In 1891 
astronomers began to record their paths on photographic 
plates. Thousands of asteroids occupying a broad belt 
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between Mars and Jupiter, at distances of 2.15-3.3 AU 
from the Sun, have since been tracked and cataloged. 

Bode’s law is not a true law in the scientific sense. It 
should be regarded as an intriguing empirical relation¬ 
ship. Some astronomers hold that the regularity of the 
planetary distances from the Sun cannot be mere chance 
but must be a manifestation of physical laws. However, 
this may be wishful thinking. No combination of physical 
laws has yet been assembled that accounts for Bode’s law. 

1.1.3.2 The terrestrial planets and the Moon 

Mercury is the closest planet to the Sun. This proximity 
and its small size make it difficult to study telescopically. 
Its orbit has a large eccentricity (0.206). At perihelion the 
planet comes within 46.0 million km (0.313AU) of the 
Sun, but at aphelion the distance is 69.8 million km 
(0.47 AU). Until 1965 the rotational period was thought 
to be the same as the period of revolution (88 days), so 
that it would keep the same face to the Sun, in the same 
way that the Moon does to the Earth. However, in 1965 
Doppler radar measurements showed that this is not the 
case. In 1974 and 1975 images from the close passage of 
Mariner 10, the only spacecraft to have visited the planet, 
gave a period of rotation of 58.8 days, and Doppler track¬ 
ing gave a radius of 2439 km. 

The spin and orbital motions of Mercury are both 
prograde and are coupled in the ratio 3:2. The spin period 
is 58.79 Earth days, almost exactly 2/3 of its orbital 
period of 87.97 Earth days. For an observer on the planet 
this has the unusual consequence that a Mercury day lasts 
longer than a Mercury year! During one orbital revolu¬ 
tion about the Sun (one Mercury year) an observer on the 
surface rotates about the spin axis 1.5 times and thus 
advances by an extra half turn. If the Mercury year 
started at sunrise, it would end at sunset, so the observer 
on Mercury would spend the entire 88 Earth days 
exposed to solar heating, which causes the surface tem¬ 
perature to exceed 700 K. During the following Mercury 
year, the axial rotation advances by a further half-turn, 
during which the observer is on the night side of the 
planet for 88 days, and the temperature sinks below 
100 K. After 2 solar orbits and 3 axial rotations, the 
observer is back at the starting point. The range of tem¬ 
peratures on the surface of Mercury is the most extreme 
in the solar system. 

Although the mass of Mercury is only about 5.5% that 
of the Earth, its mean density of 5427 kgm“ 3 is compara¬ 
ble to that of the Earth (5515kgm~ 3 ) and is the second 
highest in the solar system. This suggests that, like Earth, 
Mercury’s interior is dominated by a large iron core, 
whose radius is estimated to be about 1800-1900 km. It is 
enclosed in an outer shell 500-600 km thick, equivalent to 
Earth’s mantle and crust. The core may be partly molten. 
Mercury has a weak planetary magnetic field. 

Venus is the brightest object in the sky after the Sun and 
Moon. Its orbit brings it closer to Earth than any other 


planet, which made it an early object of study by tele¬ 
scope. Its occultation with the Sun was observed telescop¬ 
ically as early as 1639. Estimates of its radius based on 
occultations indicated about 6120 km. Galileo observed 
that the apparent size of Venus changed with its position 
in orbit and, like the Moon, the appearance of Venus 
showed different phases from crescent-shaped to full. This 
was important evidence in favor of the Copernican helio¬ 
centric model of the solar system, which had not yet 
replaced the Aristotelian geocentric model. 

Venus has the most nearly circular orbit of any planet, 
with an eccentricity of only 0.007 and mean radius of 
0.72 AU (Table 1.2). Its orbital period is 224.7 Earth days, 
and the period of rotation about its own axis is 243.7 
Earth days, longer than the Venusian year. Its spin axis is 
tilted at 177° to the pole to the ecliptic, thus making its 
spin retrograde. The combination of these motions results 
in the length of a Venusian day (the time between succes¬ 
sive sunrises on the planet) being equal to about 117 
Earth days. 

Venus is very similar in size and probable composition 
to the Earth. During a near-crescent phase the planet is 
ringed by a faint glow indicating the presence of an 
atmosphere. This has been confirmed by several space¬ 
craft that have visited the planet since the first visit by 
Mariner 2 in 1962. The atmosphere consists mainly of 
carbon dioxide and is very dense; the surface atmospheric 
pressure is 92 times that on Earth. Thick cloud cover 
results in a strong greenhouse effect that produces stable 
temperatures up to 740 K, slightly higher than the 
maximum day-time values on Mercury, making Venus the 
hottest of the planets. The thick clouds obscure any view 
of the surface, which has however been surveyed with 
radar. The Magellan spacecraft, which was placed in a 
nearly polar orbit around the planet in 1990, carried a 
radar-imaging system with an optimum resolution of 100 
meters, and a radar altimeter system to measure the 
topography and some properties of the planet’s surface. 

Venus is unique among the planets in rotating in a ret¬ 
rograde sense about an axis that is almost normal to the 
ecliptic (Table 1.1). Like Mercury, it has a high Earth-like 
density (5243 kg m -3 ). On the basis of its density together 
with gravity estimates from Magellan’s orbit, it is thought 
that the interior of Venus may be similar to that of Earth, 
with a rocky mantle surrounding an iron core about 
3000 km in radius, that is possibly at least partly molten. 
However, in contrast to the Earth, Venus has no 
detectable magnetic field. 

The Earth moves around the Sun in a slightly elliptical 
orbit. The parameters of the orbital motion are impor¬ 
tant, because they define astronomical units of distance 
and time. The Earth’s rotation about its own axis from one 
solar zenith to the next one defines the solar day (see 
Section 4.1.1.2). The length of time taken for it to com¬ 
plete one orbital revolution about the Sun defines the solar 
year, which is equal to 365.242 solar days. The eccentricity 
of the orbit is presently 0.01671 but it varies between a 
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minimum of 0.001 and a maximum of 0.060 with a period 
of about 100,000 yr due to the influence of the other 
planets. The mean radius of the orbit (149,597,871 km) is 
called an astronomical unit (AU). Distances within the 
solar system are usually expressed as multiples of this 
unit. The distances to extra-galactic celestial bodies are 
expressed as multiples of a light-year (the distance trav¬ 
elled by light in one year). The Sun's light takes about 
8 min 20 s to reach the Earth. Owing to the difficulty of 
determining the gravitational constant the mass of the 
Earth ( E) is not known with high precision, but is esti¬ 
mated to be 5.9737 X 10 24 kg. In contrast, the product GE 
is known accurately; it is equal to 3.986004418 X 10 14 
m 3 s -2 . The rotation axis of the Earth is presently inclined 
at 23.439° to the pole of the ecliptic. However, the effects 
of other planets also cause the angle of obliquity to vary 
between a minimum of 21.9° and a maximum of 24.3°, 
with a period of about 41,000 yr. 

The Moon is Earth’s only natural satellite. The dis¬ 
tance of the Moon from the Earth was first estimated 
with the method of parallax. Instead of observing the 
Moon from different positions of the Earth’s orbit, as 
shown in Fig. 1.1, the Moon’s position relative to a fixed 
star was observed at times 12 hours apart, close to moon- 
rise and moonset, when the Earth had rotated through 
half a revolution. The baseline for the measurement is 
then the Earth’s diameter. The distance of the Moon from 
the Earth was found to be about 60 times the Earth’s 
radius. 

The Moon rotates about its axis in the same sense as its 
orbital revolution about the Earth. Tidal friction resulting 
from the Earth’s attraction has slowed down the Moon’s 
rotation, so that it now has the same mean period as its rev¬ 
olution, 27.32 days. As a result, the Moon always presents 
the same face to the Earth. In fact, slightly more than half 
(about 59%) of the lunar surface can be viewed from the 
Earth. Several factors contribute to this. First, the plane of 
the Moon’s orbit around the Earth is inclined at 5°9' to the 
ecliptic while the Moon’s equator is inclined at 1°32' to the 
ecliptic. The inclination of the Moon's equator varies by 
up to 6°4T to the plane of its orbit. This is called the libra- 
tion of latitude. It allows Earth-based astronomers to see 
6°4T beyond each of the Moon’s poles. Secondly, the 
Moon moves with variable velocity around its elliptical 
orbit, while its own rotation is constant. Near perigee the 
Moon’s orbital velocity is fastest (in accordance with 
Kepler’s second law) and the rate of revolution exceeds 
slightly the constant rate of lunar rotation. Similarly, near 
apogee the Moon’s orbital velocity is slowest and the rate 
of revolution is slightly less than the rate of rotation. The 
rotational differences are called the Moon’s libration of lon¬ 
gitude. Their effect is to expose zones of longitude beyond 
the average edges of the Moon. Finally, the Earth’s diame¬ 
ter is quite large compared to the Moon’s distance from 
Earth. During Earth’s rotation the Moon is viewed from 
different angles that allow about one additional degree of 
longitude to be seen at the Moon’s edge. 


The distance to the Moon and its rotational rate are 
well known from laser-ranging using reflectors placed on 
the Moon by astronauts. The accuracy of laser-ranging is 
about 2-3 cm. The Moon has a slightly elliptical orbit 
about the Earth, with eccentricity 0.0549 and mean 
radius 384,100 km. The Moon’s own radius of 1738 km 
makes it much larger relative to its parent body than the 
natural satellites of the other planets except for Pluto’s 
moon, Charon. Its low density of 3347 kgm -3 may be due 
to the absence of an iron core. The internal composition 
and dynamics of the Moon have been inferred from 
instruments placed on the surface and rocks recovered 
from the Apollo and Luna manned missions. Below a 
crust that is on average 68 km thick the Moon has a 
mantle and a small core about 340 km in radius. In con¬ 
trast to the Earth, the interior is not active, and so the 
Moon does not have a global magnetic field. 

Mars, popularly called the red planet because of its hue 
when viewed from Earth, has been known since prehistoric 
times and was also an object of early telescopic study. In 
1666 Gian Domenico Cassini determined the rotational 
period at just over 24 hr; radio-tracking from two Viking 
spacecraft that landed on Mars in 1976, more than three 
centuries later, gave a period of 24.623 hr. The orbit of 
Mars is quite elliptical (eccentricity 0.0934). The large 
difference between perihelion and aphelion causes large 
temperature variations on the planet. The average surface 
temperature is about 218 K, but temperatures range from 
140 K at the poles in winter to 300 K on the day side in 
summer. Mars has two natural satellites, Phobos and 
Deimos. Observations of their orbits gave early estimates 
of the mass of the planet. Its size was established quite 
early telescopically from occultations. Its shape is known 
very precisely from spacecraft observations. The polar flat¬ 
tening is about double that of the Earth. The rotation rates 
of Earth and Mars are almost the same, but the lower mean 
density of Mars results in smaller gravitational forces, so at 
any radial distance the relative importance of the centrifu¬ 
gal acceleration is larger on Mars than on Earth. 

In 2004 the Mars Expedition Rover vehicles Spirit and 
Opportunity landed on Mars, and transmitted pho¬ 
tographs and geological information to Earth. Three 
spacecraft (Mars Globed Surveyor, Mars Odyssey, and 
Mars Express) were placed in orbit to carry out surveys of 
the planet. These and earlier orbiting spacecraft and 
Martian landers have revealed details of the planet that 
cannot be determined with a distant telescope (including 
the Earth-orbiting Hubble telescope). Much of the 
Martian surface is very old and cratered, but there are 
also much younger rift valleys, ridges, hills and plains. 
The topography is varied and dramatic, with mountains 
that rise to 24 km, a 4000 km long canyon system, and 
impact craters up to 2000 km across and 6 km deep. 

The internal structure of Mars can be inferred from 
the results of these missions. Mars has a relatively low 
mean density (3933 kgm -3 ) compared to the other terres¬ 
trial planets. Its mass is only about a tenth that of Earth 
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(Table 1.1), so the pressures in the planet are lower and 
the interior is less densely compressed. Mars has an inter¬ 
nal structure similar to that of the Earth. A thin crust, 
35 km thick in the northern hemisphere and 80 km thick 
in the southern hemisphere, surrounds a rocky mantle 
whose rigidity decreases with depth as the internal 
temperature increases. The planet has a dense core 
1500-1800 km in radius, thought to be composed of iron 
with a relatively large fraction of sulfur. Minute perturba¬ 
tions of the orbit of Mars Global Surveyor, caused by 
deformations of Mars due to solar tides, have provided 
more detailed information about the internal structure. 
They indicate that, like the Earth, Mars probably has a 
solid inner core and a fluid outer core that is, however, too 
small to generate a global magnetic field. 

The Asteroids occur in many sizes, ranging from several 
hundred kilometers in diameter, down to bodies that are 
too small to discern from Earth. There are 26 asteroids 
larger than 200 km in diameter, but there are probably 
more than a million with diameters around 1 km. Some 
asteroids have been photographed by spacecraft in fly-by 
missions: in 1997 the NEAR-Shoemaker spacecraft 
orbited and landed on the asteroid Eros. Hubble Space 
Telescope imagery has revealed details of Ceres (diameter 
950 km), Pallas (diameter 830 km) and Vesta (diameter 
525 km), which suggest that it may be more appropriate to 
call these three bodies protoplanets (i.e., still in the process 
of accretion from planetesimals) rather than asteroids. All 
three are differentiated and have a layered internal struc¬ 
ture like a planet, although the compositions of the inter¬ 
nal layers are different. Ceres has an oblate spheroidal 
shape and a silicate core, and is the most massive asteroid; 
it has recently been reclassified as a “dwarf planet .” Vesta’s 
shape is more irregular and it has an iron core. 

Asteroids are classified by type, reflecting their composi¬ 
tion (stony carbonaceous or metallic nickel-iron), and by 
the location of their orbits. Main belt asteroids have near¬ 
circular orbits with radii 2-A AU between Mars and Jupiter. 
The Centaur asteroids have strongly elliptical orbits that 
take them into the outer solar system. The Aten and Apollo 
asteroids follow elliptical Earth-crossing orbits. The colli¬ 
sion of one of these asteroids with the Earth would have a 
cataclysmic outcome. A 1 km diameter asteroid would 
create a 10 km diameter crater and release as much energy 
as the simultaneous detonation of most or all of the nuclear 
weapons in the world’s arsenals. In 1980 Luis and Walter 
Alvarez and their colleagues showed on the basis of an 
anomalous concentration of extra-terrestrial iridium at the 
Cretaceous-Tertiary boundary at Gubbio, Italy, that a 
10 km diameter asteroid had probably collided with Earth, 
causing directly or indirectly the mass extinctions of many 
species, including the demise of the dinosaurs. There are 
240 known Apollo bodies; however, there may be as many 
as 2000 that are 1 km in diameter and many thousands 
more measuring tens or hundreds of meters. 

Scientific opinion is divided on what the asteroid belt 
represents. One idea is that it may represent fragments of 


an earlier planet that was broken up in some disaster. 
Alternatively, it may consist of material that was never 
able to consolidate into a planet, perhaps due to the pow¬ 
erful gravitational influence of Jupiter. 

/. 1.3.3 The great planets 

The great planets are largely gaseous, consisting mostly of 
hydrogen and helium, with traces of methane, water and 
solid matter. Their compositions are inferred indirectly 
from spectroscopic evidence, because space probes have 
not penetrated their atmospheres to any great depth. In 
contrast to the rocky terrestrial planets and the Moon, 
the radius of a great planet does not correspond to a solid 
surface, but is taken to be the level that corresponds to a 
pressure of one bar, which is approximately Earth’s 
atmospheric pressure at sea-level. 

Each of the great planets is encircled by a set of con¬ 
centric rings, made up of numerous particles. The rings 
around Saturn, discovered by Galileo in 1610, are the 
most spectacular. For more than three centuries they 
appeared to be a feature unique to Saturn, but in 1977 dis¬ 
crete rings were also detected around Uranus. In 1979 the 
Voyager 1 spacecraft detected faint rings around Jupiter, 
and in 1989 the Voyager 2 spacecraft confirmed that 
Neptune also has a ring system. 

Jupiter has been studied from ground-based observato¬ 
ries for centuries, and more recently with the Hubble 
Space Telescope, but our detailed knowledge of the planet 
comes primarily from unmanned space probes that sent 
photographs and scientific data back to Earth. Between 
1972 and 1977 the planet was visited by the Pioneer 10 and 
11, Voyager 1 and 2, and Ulysses spacecraft. The space¬ 
craft Galileo orbited Jupiter for eight years, from 1995 to 
2003, and sent an instrumental probe into the atmosphere. 
It penetrated to a depth of 140 km before being crushed by 
the atmospheric pressure. 

Jupiter is by far the largest of all the planets. Its mass 
(19 X 10 26 kg) is 318 times that of the Earth (Table 1.1) 
and 2.5 times the mass of all the other planets added 
together (7.7 X 10 26 kg). Despite its enormous size the 
planet has a very low density of only 1326 kg m -3 , from 
which it can be inferred that its composition is domi¬ 
nated by hydrogen and helium. Jupiter has at least 63 
satellites, of which the four largest - Io, Europa, 
Ganymede and Callisto - were discovered in 1610 by 
Galileo. The orbital motions of Io, Europa and 
Ganymede are synchronous, with periods locked in the 
ratio 1:2:4. In a few hundred million years, Callisto will 
also become synchronous with a period 8 times that of 
Io. Ganymede is the largest satellite in the solar system; 
with a radius of 2631km it is slightly larger than the 
planet Mercury. Some of the outermost satellites are less 
than 30 km in radius, revolve in retrograde orbits and 
may be captured asteroids. Jupiter has a system of rings, 
which are like those of Saturn but are fainter and smaller, 
and were first detected during analysis of data from 
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Voyager 1. Subsequently, they were investigated in detail 
during the Galileo mission. 

Jupiter is thought to have a small, hot, rocky core. This 
is surrounded by concentric layers of hydrogen, first in a 
liquid-metallic state (which means that its atoms, although 
not bonded to each other, are so tightly packed that the 
electrons can move easily from atom to atom), then non- 
metallic liquid, and finally gaseous. The planet’s atmos¬ 
phere consists of approximately 86% hydrogen and 14% 
helium, with traces of methane, water and ammonia. The 
liquid-metallic hydrogen layer is a good conductor of elec¬ 
trical currents. These are the source of a powerful mag¬ 
netic field that is many times stronger than the Earth’s and 
enormous in extent. It stretches for several million kilome¬ 
ters toward the Sun and for several hundred million kilo¬ 
meters away from it. The magnetic field traps charged 
particles from the Sun, forming a zone of intense radia¬ 
tion outside Jupiter’s atmosphere that would be fatal to a 
human being exposed to it. The motions of the electric 
charges cause radio emissions. These are modulated by the 
rotation of the planet and are used to estimate the period 
of rotation, which is about 9.9 hr. 

Jupiter’s moon Europa is the focus of great interest 
because of the possible existence of water below its icy 
crust, which is smooth and reflects sunlight brightly. The 
Voyager spacecraft took high-resolution images of the 
moon’s surface, and gravity and magnetic data were 
acquired during close passages of the Galileo spacecraft. 
Europa has a radius of 1565 km, so is only slightly smaller 
than Earth’s Moon, and is inferred to have an iron-nickel 
core within a rocky mantle, and an outer shell of water 
below a thick surface ice layer. 

Saturn is the second largest planet in the solar system. 
Its equatorial radius is 60,268 km and its mean density is 
merely 687 kg m -3 (the lowest in the solar system and less 
than that of water). Thin concentric rings in its equator¬ 
ial plane give the planet a striking appearance. The obliq¬ 
uity of its rotation axis to the ecliptic is 26.7°, similar to 
that of the Earth (Table 1.1). Consequently, as Saturn 
moves along its orbit the rings appear at different angles 
to an observer on Earth. Galileo studied the planet by 
telescope in 1610 but the early instrument could not 
resolve details and he was unable to interpret his observa¬ 
tions as a ring system. The rings were explained by 
Christiaan Huygens in 1655 using a more powerful tele¬ 
scope. In 1675, Domenico Cassini observed that Saturn’s 
rings consisted of numerous small rings with gaps 
between them. The rings are composed of particles of 
ice, rock and debris, ranging in size from dust particles up 
to a few cubic meters, which are in orbit around the 
planet. The origin of the rings is unknown; one theory 
is that they are the remains of an earlier moon that 
disintegrated, either due to an extra-planetary impact or 
as a result of being torn apart by bodily tides caused by 
Saturn’s gravity. 

In addition to its ring system Saturn has more than 30 
moons, the largest of which, Titan, has a radius of 


2575 km and is the only moon in the solar system with a 
dense atmosphere. Observations of the orbit of Titan 
allowed the first estimate of the mass of Saturn to be 
made in 1831. Saturn was visited by the Pioneer 11 space¬ 
craft in 1979 and later by Voyager 1 and Voyager 2. In 
2004 the spacecraft Cassini entered orbit around Saturn, 
and launched an instrumental probe, Huygens, that 
landed on Titan in January 2005. Data from the probe 
were obtained during the descent by parachute through 
Titan’s atmosphere and after landing, and relayed to 
Earth by the orbiting Cassini spacecraft. 

Saturn’s period of rotation has been deduced from 
modulated radio emissions associated with its magnetic 
field. The equatorial zone has a period of 10 hr 14 min, 
while higher latitudes have a period of about 10 hr 39 min. 
The shape of the planet is known from occultations of 
radio signals from the Voyager spacecrafts. The rapid 
rotation and fluid condition result in Saturn having the 
greatest degree of polar flattening of any planet, amount¬ 
ing to almost 10%. Its mean density of 687kgnD 3 is the 
lowest of all the planets, implying that Saturn, like 
Jupiter, is made up mainly of hydrogen and helium and 
contains few heavy elements. The planet probably also 
has a similar layered structure, with rocky core overlain 
successively by layers of liquid-metallic hydrogen and 
molecular hydrogen. However, the gravitational field of 
Jupiter compresses hydrogen to a metallic state, which has 
a high density. This gives Jupiter a higher mean density 
than Saturn. Saturn has a planetary magnetic field that 
is weaker than Jupiter’s but probably originates in the 
same way. 

Uranus is so remote from the Earth that Earth-bound 
telescopic observation reveals no surface features. Until 
the fly-past of Voyager 2 in 1986 much had to be surmised 
indirectly and was inaccurate. Voyager 2 provided detailed 
information about the size, mass and surface of the planet 
and its satellites, and of the structure of the planet’s ring 
system. The planet’s radius is 25,559 km and its mean 
density is 1270kgm -3 . The rotational period, 17.24hr, 
was inferred from periodic radio emissions detected by 
Voyager which are believed to arise from charged particles 
trapped in the magnetic field and thus rotating with the 
planet. The rotation results in a polar flattening of 2.3%. 
Prior to Voyager, there were five known moons. Voyager 
discovered a further 10 small moons, and a further 12 
more distant from the planet have been discovered subse¬ 
quently, bringing the total of Uranus’ known moons to 27. 
The composition and internal structure of Uranus are 
probably different from those of Jupiter and Saturn. The 
higher mean density of Uranus suggests that it contains 
proportionately less hydrogen and more rock and ice. The 
rotation period is too long for a layered structure with 
melted ices of methane, ammonia and water around a 
molten rocky core. It agrees better with a model in which 
heavier materials are less concentrated in a central core, 
and the rock, ices and gases are more uniformly distrib¬ 
uted. 
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Several paradoxes remain associated with Uranus. The 
axis of rotation is tilted at an angle of 98° to the pole to 
the planet's orbit, and thus lies close to the ecliptic plane. 
The reason for the extreme tilt, compared to the other 
planets, is unknown. The planet has a prograde rotation 
about this axis. However, if the other end of the rotation 
axis, inclined at an angle of 82°, is taken as reference, the 
planet's spin can be regarded as retrograde. Both interpre¬ 
tations are equivalent. The anomalous axial orientation 
means that during the 84 years of an orbit round the Sun 
the polar regions as well as the equator experience 
extreme solar insolation. The magnetic field of Uranus is 
also anomalous: it is inclined at a large angle to the rota¬ 
tion axis and its center is displaced axially from the center 
of the planet. 

Neptune is the outermost of the gaseous giant planets. 
It can only be seen from Earth with a good telescope. By 
the early nineteenth century, the motion of Uranus had 
become well enough charted that inconsistencies were 
evident. French and English astronomers independently 
predicted the existence of an eighth planet, and the pre¬ 
dictions led to the discovery of Neptune in 1846. The 
planet had been noticed by Galileo in 1612, but due to its 
slow motion he mistook it for a fixed star. The period of 
Neptune's orbital rotation is almost 165 yr, so the planet 
has not yet completed a full orbit since its discovery. As a 
result, and because of its extreme distance from Earth, 
the dimensions of the planet and its orbit were not well 
known until 1989, when Voyager 2 became the first - and, 
so far, the only - spacecraft to visit Neptune. 

Neptune’s orbit is nearly circular and lies close to the 
ecliptic. The rotation axis has an Earth-like obliquity of 
29.6° and its axial rotation has a period of 16.11 hr, which 
causes a polar flattening of 1.7%. The planet has a radius 
of 24,766km and a mean density of 1638 kgm -3 . The 
internal structure of Neptune is probably like that of 
Uranus: a small rocky core (about the size of planet 
Earth) is surrounded by a non-layered mixture of rock, 
water, ammonia and methane. The atmosphere is pre¬ 
dominantly of hydrogen, helium and methane, which 
absorbs red light and gives the planet its blue color. 

The Voyager 2 mission revealed that Neptune has 13 
moons and a faint ring system. The largest of the moons, 
Triton, has a diameter about 40% of Earth’s and its 
density (2060kgm -3 ) is higher than that of most other 
large moons in the solar system. Its orbit is steeply 
inclined at 157° to Neptune’s equator, making it the only 
large natural satellite in the solar system that rotates 
about its planet in retrograde sense. The moon’s physical 
characteristics, which resemble the planet Pluto, and its 
retrograde orbital motion suggest that Triton was cap¬ 
tured from elsewhere in the outer solar system. 

1.1.3.4 Pluto and the outer solar system 

Until its reclassification in 2006 as a “dwarf planet,” 
Pluto was the smallest planet in the solar system, about 


two-thirds the diameter of Earth’s Moon. It has 
many unusual characteristics. Its orbit has the largest 
inclination to the ecliptic (17.1°) of any major planet and 
it is highly eccentric (0.249), with aphelion at 49.3 AU 
and perihelion at 29.7 AU. This brings Pluto inside 
Neptune’s orbit for 20 years of its 248-year orbital 
period; the paths of Pluto and Neptune do not intersect. 
The orbital period is resonant with that of Neptune in 
the ratio 3:2 (i.e., Pluto’s period is exactly 1.5 times 
Neptune’s). These features preclude any collision bet¬ 
ween the planets. 

Pluto is so far from Earth that it appears only as a 
speck of light to Earth-based telescopes and its surface 
features can be resolved only broadly with the Hubble 
Space Telescope. It is the only planet that has not been 
visited by a spacecraft. It was discovered fortuitously in 
1930 after a systematic search for a more distant planet to 
explain presumed discrepancies in the orbit of Neptune 
which, however, were later found to be due to inaccurate 
estimates of Neptune's mass. The mass and diameter of 
Pluto were uncertain for some decades until in 1978 a 
moon, Charon, was found to be orbiting Pluto at a mean 
distance of 19,600 km. Pluto’s mass is only 0.21% that of 
the Earth. Charon’s mass is about 10-15% of Pluto’s, 
making it the largest moon in the solar system relative to 
its primary planet. The radii of Pluto and Charon are 
estimated from observations with the Hubble Space 
Telescope to be 1137 km and 586 km, respectively, with a 
relative error of about 1%. The mass and diameter of 
Pluto give an estimated density about 2000 kg nr 3 from 
which it is inferred that Pluto’s composition may be a 
mixture of about 70% rock and 30% ice, like that of 
Triton, Neptune’s moon. Charon’s estimated density is 
lower, about 1300kgm -3 , which suggests that there may 
be less rock in its composition. 

Pluto’s rotation axis is inclined at about 122° to its 
orbital plane, so the planet’s axial rotation is retrograde, 
and has a period of 6.387 days. Charon also orbits Pluto 
in a retrograde sense. As a result of tidal forces, Charon’s 
orbital period is synchronous with both its own axial 
rotation and Pluto’s. Thus, the planet and moon con¬ 
stantly present the same face to each other. Because of the 
rotational synchronism and the large relative mass of 
Charon, some consider Pluto-Charon to be a double 
planet. However, this is unlikely because their different 
densities suggest that the bodies originated indepen¬ 
dently. Observations with the Hubble Space Telescope in 
2005 revealed the presence of two other small moons - 
provisionally named 2005 PI and 2005 P2 - in orbit 
around Pluto in the same sense as Charon, but at a larger 
distance of about 44,000 km. All three moons have the 
same color spectrum, which differs from Pluto’s and sug¬ 
gests that the moons were captured in a single collision 
with another large body. However, the origins of Pluto, 
Charon and the smaller moons are as yet unknown, and 
are a matter of scientific conjecture. 

Since the early 1990s thousands of new objects have 
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been identified beyond the orbit of Neptune. The trans- 
Neptunian objects (Box 1.3) are mostly small, but at least 
one, Eris, is comparable in size to Pluto. The new discover¬ 
ies fueled discussion about Pluto’s status as a planet. In 
2006 the definition of what constitutes a planet was modi¬ 
fied. To be a planet an object must (1) be in orbit around a 
star (Sun), (2) be large enough so that its own gravitation 
results in a spherical or spheroidal shape, (3) not be so 
large as to initiate nuclear fusion, and (4) have cleared 
the neighborhood around its orbit of planetesimals. 
Conditions (1) and (3) are met by all objects orbiting the 
Sun. An object that meets conditions (1) and (2) and is not 
a satellite of another body is called a “dwarf planet.” Pluto 
falls in this new category, along with the asteroid Ceres and 
the scattered disk object Eris (Box 1.3). 

1.1.3.5 Angular momentum 

An important characteristic that constrains models of the 
origin of the solar system is the difference between the 
distributions of mass and angular momentum. To deter¬ 
mine the angular momentum of a rotating body it is nec¬ 
essary to know its moment of inertia. For a particle of 
mass in the moment of inertia (/) about an axis at distance 
r is defined as: 

I=mr 2 (1-4) 

The angular momentum (/?) is defined as the product of 
its moment of inertia (I) about an axis and its rate of rota¬ 
tion (11) about that axis: 

h=m (1.5) 

Each planet revolves in a nearly circular orbit around 
the Sun and at the same time rotates about its own axis. 
Thus there are two contributions to its angular momen¬ 
tum (Table 1.3). The angular momentum of a planet’s 
revolution about the Sun is obtained quite simply. The 
solar system is so immense that the physical size of each 
planet is tiny compared to the size of its orbit. The 
moment of inertia of a planet about the Sun is computed 
by inserting the mass of the planet and its orbital radius 
(Table 1.3) in Eq. (1.4); the orbital angular momentum of 
the planet follows by combining the computed moment of 
inertia with the rate of orbital revolution as in Eq. (1.5). 
To determine the moment of inertia of a solid body about 
an axis that passes through it (e.g., the rotational axis of a 
planet) is more complicated. Equation (1.4) must be com¬ 
puted and summed for all particles in the planet. If the 
planet is represented by a sphere of mass M and mean 
radius R, the moment of inertia C about the axis of rota¬ 
tion is given by 

C = kMR 1 (1.6) 

where the constant k is determined by the density distrib¬ 
ution within the planet. For example, if the density is 
uniform inside the sphere, the value of k is exactly 2/5, or 


0.4; for a hollow sphere it is 2/3. If density increases with 
depth in the planet, e.g., if it has a dense core, the value of 
k is less than 0.4; for the Earth, k = 0.3308. For some 
planets the variation of density with depth is not well 
known, but for most planets there is enough information 
to calculate the moment of inertia about the axis of rota¬ 
tion; combined with the rate of rotation as in Eq. (1.5), 
this gives the rotational angular momentum. 

The angular momentum of a planet’s revolution about 
the Sun is much greater (on average about 60,000 times) 
than the angular momentum of its rotation about its own 
axis (Table 1.3). Whereas more than 99.9% of the total 
mass of the solar system is concentrated in the Sun, more 
than 99% of the angular momentum is carried by the 
orbital motion of the planets, especially the four great 
planets. Of these Jupiter is a special case: it accounts for 
over 70% of the mass and more than 60% of the angular 
momentum of the planets. 

1.1.4 The origin of the solar system 

There have been numerous theories for the origin of the 
solar system. Age determinations on meteorites indicate 
that the solar system originated about (4.5-4.6)Xl0 9 
years ago. A successful theory of how it originated must 
account satisfactorily for the observed characteristics of 
the planets. The most important of these properties are 
the following. 

(1) Except for Pluto, the planetary orbits lie in or close to 
the same plane, which contains the Sun and the orbit 
of the Earth (the ecliptic plane). 

(2) The planets revolve about the Sun in the same sense, 
which is counterclockwise when viewed from above 
the ecliptic plane. This sense of rotation is defined as 
prograde. 

(3) The rotations of the planets about their own axes are 
also mostly prograde. The exceptions are Venus, 
which has a retrograde rotation; Uranus, whose axis 
of rotation lies nearly in the plane of its orbit; and 
Pluto, whose rotation axis and orbital plane are 
oblique to the ecliptic. 

(4) Each planet is roughly twice as far from the Sun as its 
closest neighbor (Bode’s law). 

(5) The compositions of the planets make up two dis¬ 
tinct groups: the terrestrial planets lying close to the 
Sun are small and have high densities, whereas the 
great planets far from the Sun are large and have low 
densities. 

(6) The Sun has almost 99.9% of the mass of the solar 
system, but the planets account for more than 99% of 
the angular momentum. 

The first theory based on scientific observation was the 
nebular hypothesis introduced by the German philosopher 
Immanuel Kant in 1755 and formulated by the French 
astronomer Pierre Simon de Laplace in 1796. According 
to this hypothesis the planets and their satellites were 
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Box 1.3: Trans-Neptunian objects 


A trans-Neptunian object (TNO) is any object in orbit 
around the Sun at a greater average distance than 
Neptune. They include Pluto and its moon Charon, as 
well as numerous other bodies. The objects are grouped 
in three classes according to the size of their orbit: the 
Kuiper belt, Scattered Disk, and Oort Cloud. Their 
composition is similar to that of comets, i.e., mainly ice, 
although some have densities high enough to suggest 
other rock-like components. 

The Kuiper belt extends beyond the mean radius of 
Neptune’s orbit at 30 AU to a distance of about 50 AU 
(Fig. B1.3). This disk-shaped region, close to the ecliptic 
plane, contains thousands of objects in orbit around the 
Sun. According to some estimates there are more than 
35,000 Kuiper belt objects larger than 100 km in diame¬ 
ter, so they are much larger and more numerous than the 
asteroids. Some have orbital periods that are in resonance 
with the orbit of Neptune, and this has given rise to some 
curious appellations for them. Objects like Pluto with 
orbital periods in 3:2 resonance with Neptune are called 
plutinos, those further out in the belt with periods in 2:1 
resonance are called twotinos, and objects in intermedi¬ 
ate orbits are called cubewanos. The Kuiper belt objects 
are all largely icy in composition, and some of them are 
quite large. For example, Quaoar, in an orbit with semi¬ 
major axis 43.5 AU, has a diameter of 1260 km and so is 
about the same size as Pluto’s moon, Charon. 

Objects in orbit at mean distances greater than 50 AU 
are called scattered disk objects. A large trans-Neptunian 
object, 2003UB 313 , was identified in 2003 and confirmed 
in 2005 during a long-term search for distant moving 
objects in the solar system. On the basis of its reflectivity 
this object is larger than Pluto, and was at first consid¬ 
ered to be the tenth planet in the solar system. It has an 
orbital period of 557 yr, a highly elliptical orbit inclined 
at 44° to the ecliptic, and is currently near to aphelion. 
Its present heliocentric distance of 97 AU makes it the 
most distant known object in the solar system. Now 
named Eris, it is classified together with Pluto and the 
asteroid Ceres in the new category of “dwarf planet .” 

In 2004 another trans-Neptunian object, Sedna, was 
discovered at a distance of 90 AU (Fig. B1.3). It is 
presently closer to the Sun than Eris, but its extremely 
elliptical orbit (eccentricity 0.855, inclination 12°) takes 
Sedna further into the outer reaches of the solar system 
than any known object. Its orbital period is 12,500 yrs 
and its aphelion lies at about 975 AU. The object is 
visible to astronomers only as a tiny speck so apart from 
its orbit not much is known about it. It is considered to 
be the only known object that may have originated in 
the Oort cloud. 
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Fig. B1.3 The relative sizes of the Oort cloud and Kuiper belt in 
relation to the orbits of the outer planets. The inner planets and Sun 
are contained within the innermost circle of the lower part of the 
figure (courtesy NASA/JPL-Caltech). 


The Oort cloud is thought to be the source of most 
new comets that enter the inner solar system. It is visual¬ 
ized as a spherical cloud of icy objects at an enormous 
distance - between 50,000 and 100,000 AU (roughly one 
light year) - from the Sun. The Oort cloud has never 
been observed, but its existence has been confirmed 
from work on cometary orbits. It plays a central role in 
models of the origin of comets. 
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formed at the same time as the Sun. Space was filled by a 
rotating cloud ( nebula ) of hot primordial gas and dust 
that, as it cooled, began to contract. To conserve the 
angular momentum of the system, its rotation speeded up; 
a familiar analogy is the way a pirouetting skater spins 
more rapidly when he draws in his outstretched arms. 
Centrifugal force would have caused concentric rings of 
matter to be thrown off, which then condensed into 
planets. A serious objection to this hypothesis is that the 
mass of material in each ring would be too small to 
provide the gravitational attraction needed to cause the 
ring to condense into a planet. Moreover, as the nebula 
contracted, the largest part of the angular momentum 
would remain associated with the main mass that con¬ 
densed to form the Sun, which disagrees with the observed 
distribution of angular momentum in the solar system. 

Several alternative models were postulated subse¬ 
quently, but have also fallen into disfavor. For example, 
the collision hypothesis assumed that the Sun was formed 
before the planets. The gravitational attraction of a 
closely passing star or the blast of a nearby supernova 
explosion drew out a filament of solar material that con¬ 
densed to form the planets. However, a major objection to 
this scenario is that the solar material would have been so 
hot that it would dissipate explosively into space rather 
than condense slowly to form the planets. 

Modern interpretations of the origin of the solar 
system are based on modifications of the nebular 
hypothesis. As the cloud of gas and dust contracted, its 
rate of rotation speeded up, flattening the cloud into a 
lens-shaped disk. When the core of the contracting cloud 
became dense enough, gravitation caused it to collapse 
upon itself to form a proto-Sun in which thermonuclear 
fusion was initiated. Hydrogen nuclei combined under 
the intense pressure to form helium nuclei, releasing huge 
amounts of energy. The material in the spinning disk was 
initially very hot and gaseous but, as it cooled, solid 
material condensed out of it as small grains. The grains 
coalesced as rocky or icy clumps called planetesimals. 
Asteroid-like planetesimals with a silicate, or rocky, com¬ 
position formed near the Sun, while comet-like planetesi¬ 
mals with an icy composition formed far from the Sun’s 
heat. In turn, helped by gravitational attraction, the plan¬ 
etesimals accreted to form the planets. Matter with a high 
boiling point (e.g., metals and silicates) could condense 
near to the Sun, forming the terrestrial planets. Volatile 
materials (e.g., water, methane) would vaporize and be 
driven into space by the stream of particles and radiation 
from the Sun. During the condensation of the large 
cold planets in the frigid distant realms of the solar 
system, the volatile materials were retained. The gravita¬ 
tional attractions of Jupiter and Saturn may have been 
strong enough to retain the composition of the original 
nebula. 

It is important to keep in mind that this scenario is 
merely a hypothesis - a plausible but not unique explana¬ 
tion of how the solar system formed. It attributes the 


variable compositions of the planets to accretion at 
different distances from the Sun. The model can be 
embellished in many details to account for the character¬ 
istics of individual planets. However, the scenario is 
unsatisfactory because it is mostly qualitative. For 
example, it does not adequately explain the division of 
angular momentum. Physicists, astronomers, space scien¬ 
tists and mathematicians are constantly trying new 
methods of investigation and searching for additional 
clues that will improve the hypothesis of how the solar 
system formed. 

1.2 THE DYNAMIC EARTH 

1.2.1 Historical introduction 

The Earth is a dynamic planet, perpetually changing both 
externally and internally. Its surface is constantly being 
altered by endogenic processes (i.e., of internal origin) 
resulting in volcanism and tectonism, as well as by exo¬ 
genic processes (i.e., of external origin) such as erosion 
and deposition. These processes have been active 
throughout geological history. Volcanic explosions like 
the 1980 eruption of Mt. St. Helens in the northwestern 
United States can transform the surrounding landscape 
virtually instantaneously. Earthquakes also cause sudden 
changes in the landscape, sometimes producing faults 
with displacements of several meters in seconds. Weather- 
related erosion of surface features occasionally occurs at 
dramatic rates, especially if rivers overflow or landslides 
are triggered. The Earth's surface is also being changed 
constantly by less spectacular geological processes at 
rates that are extremely slow in human terms. Regions 
that have been depressed by the loads of past ice-sheets 
are still rebounding vertically at rates of up to several 
mmyr -1 . Tectonic forces cause mountains to rise at 
similar uplift rates, while the long-term average effects of 
erosion on a regional scale occur at rates of cm yr -1 . On a 
larger scale the continents move relative to each other at 
speeds of up to several cm yr -1 for time intervals lasting 
millions of years. Extremely long times are represented in 
geological processes. This is reflected in the derivation of 
a geological timescale (Section 4.1.1.3). The subdivisions 
used below are identified in Fig. 4.2. 

The Earth's interior is also in motion. The mantle 
appears hard and solid to seismic waves, but is believed to 
exhibit a softer, plastic behavior over long geological time 
intervals, flowing (or ‘’creeping”) at rates of several 
cm yr -1 . Deeper inside the Earth, the liquid core probably 
flows at a geologically rapid rate of a few tenths of a mil¬ 
limeter per second. 

Geologists have long been aware of the Earth’s dynamic 
condition. Several hypotheses have attempted to explain 
the underlying mechanisms. In the late nineteenth and 
early twentieth centuries geological orthodoxy favored the 
hypothesis of a contracting Earth. Mountain ranges were 
thought to have formed on its shrinking surface like 
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wrinkles on a desiccating apple. Horizontal tectonic dis¬ 
placements were known, but were considered to be a 
by-product of more important vertical motions. The real¬ 
ization that large overthrusts played an important role in 
the formation of nappe structures in the Alps implied 
amounts of horizontal shortening that were difficult to 
accommodate in the contraction hypothesis. A new school 
of thought emerged in which mountain-building was 
depicted as a consequence of horizontal displacements. 

A key observation in this context was the congruity 
between the opposing coasts of the South Atlantic, espe¬ 
cially the similar shapes of the coastlines of Brazil and 
Africa. As early as 1620, despite the inaccuracy and 
incompleteness of early seventeenth century maps, 
Francis Bacon drew attention to the parallelism of the 
Atlantic-bordering coastlines. In 1858 Antonio Snider 
constructed a map showing relative movements of the 
circum-Atlantic continents, although he did not maintain 
the shapes of the coastlines. In the late nineteenth century 
the Austrian geologist Eduard Suess coined the name 
Gondwanaland for a proposed great southern continent 
that existed during late Paleozoic times. It embodied 
Africa, Antarctica, Arabia, Australia, India and South 
America, and lay predominantly in the southern hemi¬ 
sphere. The Gondwana continents are now individual 
entities and some (e.g., India, Arabia) no longer lie in the 
southern hemisphere, but they are often still called the 
“southern continents." In the Paleozoic, the “northern 
continents" of North America (including Greenland), 
Europe and most of Asia also formed a single continent, 
called Laurasia. Laurasia and Gondwanaland split apart 
in the Early Mesozoic. The Alpine-Himalayan mountain 
belt was thought to have developed from a system of geo¬ 
synclines that formed in the intervening sea, which Suess 
called the Tethys ocean to distinguish it from the present 
Mediterranean Sea. Implicit in these reconstructions is 
the idea that the continents subsequently reached their 
present positions by slow horizontal displacements across 
the surface of the globe. 

1.2.2 Continental drift 

The “displacement hypothesis” of continental movements 
matured early in the twentieth century. In 1908 F. B. Taylor 
related the world’s major fold-belts to convergence of the 
continents as they moved away from the poles, and in 1911 
H. B. Baker reassembled the Atlantic-bordering continents 
together with Australia and Antarctica into a single conti¬ 
nent; regrettably he omitted Asia and the Pacific. However, 
the most vigorous proponent of the displacement hypothe¬ 
sis was Alfred Wegener, a German meteorologist and geol¬ 
ogist. In 1912 Wegener suggested that all of the continents 
were together in the Late Paleozoic, so that the land area of 
the Earth formed a single landmass (Fig. 1.5). He coined 
the name Pangaea (Greek for “all Earth”) for this super¬ 
continent, which he envisioned was surrounded by a single 
ocean ( Panthalassa ). Wegener referred to the large-scale 



Fig. 1.5 (a) Wegener's reconstruction of Pangaea in the Late 
Carboniferous, showing estimated positions of the North and South 
poles and paleo-equator. Shaded areas, arid regions; K, coal deposits; S, 
salt deposits; W, desert regions; E, ice sheets (modified after Koppen 
and Wegener, 1924). Relative positions of the continents are shown in 
(b) the Eocene (shaded areas, shallow seas) and (c) the Early Quaternary 
(after Wegener, 1922). The latitudes and longitudes are arbitrary. 

horizontal displacement of crustal blocks having continen¬ 
tal dimensions as Kontinentalverschiebung. The anglicized 
form, continental drift, implies additionally that displace¬ 
ments of the blocks take place slowly over long time inter¬ 
vals. 

1.2.2.1 Pangaea 

As a meteorologist Wegener was especially interested in 
paleoclimatology. For the first half of the twentieth century 
the best evidence for the continental drift hypothesis and the 
earlier existence of Pangaea consisted of geological indica¬ 
tors of earlier paleoclimates. In particular, Wegener 
observed a much better alignment of regions of Permo- 
Carboniferous glaciation in the southern hemisphere when 
the continents were in the reconstructed positions for 
Gondwanaland instead of their present positions. His 
reconstruction of Pangaea brought Carboniferous coal 
deposits into alignment and suggested that the positions of 
the continents relative to the Paleozoic equator were quite 
different from their modern ones. Together with W. Koppen, 
a fellow German meteorologist, he assembled paleoclimatic 
data that showed the distributions of coal deposits (evi¬ 
dence of moist temperate zones), salt, gypsum and desert 
sandstones (evidence of dry climate) for several geological 
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eras (Carboniferous, Permian, Eocene, Quaternary). When 
plotted on Wegener’s reconstruction maps, the paleocli- 
matic data for each era formed climatic belts just like today; 
namely, an equatorial tropical rain belt, two adjacent dry 
belts, two temperate rain belts, and two polar ice caps (Fig. 
1.5a). 

Wegener's continental drift hypothesis was bolstered in 
1937 by the studies of a South African geologist, 
Alexander du Toit, who noted sedimentological, paleon¬ 
tological, paleoclimatic, and tectonic similarities between 
western Africa and eastern South America. These favored 
the Gondwanaland reconstruction rather than the present 
configuration of continents during the Late Paleozoic and 
Early Mesozoic. 

Some of Wegener’s theories were largely conjectural. On 
the one hand, he reasoned correctly that the ocean basins 
are not permanent. Yet he envisioned the sub-crustal mater¬ 
ial as capable of viscous yield over long periods of time, 
enabling the continents to drift through the ocean crust like 
ships through water. This model met with profound scepti¬ 
cism among geologists. He believed, in the face of strong 
opposition from physicists, that the Earth’s geographic axis 
had moved with time, instead of the crust moving relative to 
the fixed poles. His timing of the opening of the Atlantic 
(Fig. 1.5b, c) was faulty, requiring a large part of the separa¬ 
tion of South America from Africa to take place since the 
Early Pleistocene (i.e., in the last two million years or so). 
Moreover, he was unable to offer a satisfactory driving 
mechanism for continental drift. His detractors used the 
disprovable speculations to discredit his better-documented 
arguments in favor of continental drift. 

1.2.2.2 Computer-assisted reconstructions 

Wegener pointed out that it was not possible to fit the 
continents together using their present coastlines, which 
are influenced by recent sedimentary deposits at the 
mouths of major rivers as well as the effects of coastal 
erosion. The large areas of continental shelf must also be 
taken into account, so Wegener matched the continents at 
about the edges of the continental shelves, where the con¬ 
tinental slopes plunge into the oceanic basins. The match¬ 
ing was visual and inexact by modern standards, but more 
precise methods only became available in the 1960s with 
the development of powerful computers. 

In 1965 E. C. Bullard, J. E. Everett and A. G. Smith 
used a computer to match the relative positions of the con¬ 
tinents bounding the Atlantic ocean (Fig. 1.6). They digi¬ 
tized the continental outlines at approximately 50 km 
intervals for different depth contours on the continental 
slopes, and selected the fit of the 500 fathom (900 m) depth 
contour as optimum. The traces of opposite continental 
margins were matched by an iterative procedure. One trace 
was rotated relative to the other (about a pole of relative 
rotation) until the differences between the traces were min¬ 
imized; the procedure was then repeated with different 
rotation poles until the best fit was obtained. The optimum 



Fig. 1.6 Computer-assisted fit of the Atlantic-bordering continents at 
the 500 fathom (900 m) depth (after Bullard eta!., 1965). 

fit is not perfect, but has some overlaps and gaps. 
Nevertheless, the analysis gives an excellent geometric fit of 
the opposing coastlines of the Atlantic. 

A few years later A. G. Smith and A. Hallam used the 
same computer-assisted technique to match the coastlines 
of the southern continents, also at the 500 fathom depth 
contour (Fig. 1.7). They obtained an optimum geometric 
reconstruction of Gondwanaland similar to the visual 
match suggested by du Toit in 1937; it probably represents 
the geometry of Gondwanaland that existed in the Late 
Paleozoic and Early Mesozoic. It is not the only possible 
good geometric fit, but it also satisfies other geological 
evidence. At various times in the Jurassic and Cretaceous, 
extensional plate margins formed within Gondwanaland, 
causing it to subdivide to form the present “southern con¬ 
tinents.” The dispersal to their present positions took 
place largely in the Late Cretaceous and Tertiary. 

Pangaea existed only in the Late Paleozoic and Early 
Mesozoic. Geological and geophysical evidence argues in 
favor of the existence of its northern and southern con¬ 
stituents - Laurasia and Gondwanaland - as separate enti¬ 
ties in the Early Paleozoic and Precambrian. An important 
source of data bearing on continental reconstructions in 
ancient times and the drift of the continents is provided by 
paleomagnetism, which is the record of the Earth’s ancient 
magnetic field. Paleomagnetism is described in Section 5.6 
and summarized below. 
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Fig. 1.7 Computer-assisted 
fit of the continents that 
formed Gondwanaland (after 
Smith and Hallam, 1970). 



1.2.2.3 Paleomagnetism and continental drift 

In the late nineteenth century geologists discovered that 
rocks can carry a stable record of the geomagnetic field 
direction at the time of their formation. From the magneti¬ 
zation direction it is possible to calculate the position of 
the magnetic pole at that time; this is called the virtual geo¬ 
magnetic pole (VGP) position. Averaged over a time inter¬ 
val longer than a few tens of thousands of years, the mean 
VGP position coincides with the geographic pole, as if the 
axis of the mean geomagnetic dipole field were aligned 
with the Earth’s rotation axis. This correspondence can be 
proved for the present geomagnetic field, and a fundamen¬ 
tal assumption of paleomagnetism - called the “axial 
dipole hypothesis” - is that it has always been valid. The 
hypothesis can be verified for rocks and sediments up to a 
few million years old, but its validity has to be assumed for 
earlier geological epochs. However, the self-consistency of 
paleomagnetic data and their compatibility with continen¬ 
tal reconstructions argue that the axial dipole hypothesis is 
also applicable to the Earth’s ancient magnetic field. 

For a particular continent, rocks of different ages give 
different mean VGP positions. The appearance that the 
pole has shifted with time is called apparent polar wander 
(APW). By connecting mean VGP positions of different 
ages for sites on the same continent a line is obtained, 
called the apparent polar wander path of the continent. 
Each continent yields a different APW path, which conse¬ 
quently cannot be the record of movement of the pole. 
Rather, each APW path represents the movement of the 
continent relative to the pole. By comparing APW paths 
the movements of the continents relative to each other 
can be reconstructed. The APW paths provide strong 
supporting evidence for continental drift. 

Paleomagnetism developed as a geological discipline in 
the 1950s and 1960s. The first results indicating large-scale 


continental movement were greeted with some scepticism. 
In 1956 S. K. Runcorn demonstrated that the paleomag¬ 
netic data from Permian and Triassic rocks in North 
America and Great Britain agreed better if the Atlantic 
ocean were closed, i.e., as in the Laurasia configuration. In 
1957 E. Irving showed that Mesozoic paleomagnetic data 
from the “southern continents” were more concordant 
with du Toit’s Gondwanaland reconstruction than with 
the present arrangement of the continents. Since these 
pioneering studies numerous paleomagnetic investiga¬ 
tions have established APW paths for the different conti¬ 
nents. The quality of the paleomagnetic record is good for 
most geological epochs since the Devonian. 

The record for older geological periods is less reliable 
for several reasons. In the Early Paleozoic the data 
become fewer and the APW paths become less well 
defined. In addition, the oldest parts of the paleomag¬ 
netic record are clouded by the increasing possibility of 
false directions due to undetected secondary magnetiza¬ 
tion. This happens when thermal or tectonic events alter 
the original magnetization, so that its direction no longer 
corresponds to that at the time of rock formation. 
Remagnetization can affect rocks of any age, but it is rec¬ 
ognized more readily and constitutes a less serious 
problem in younger rocks. 

Problems afflicting Precambrian paleomagnetism are 
even more serious than in the Early Paleozoic. APW paths 
have been derived for the Precambrian, especially for 
North America, but only in broad outline. In part this is 
because it is difficult to date Precambrian rocks precisely 
enough to determine the fine details of an APW path. It is 
often not possible to establish which is the north or south 
pole. In addition, the range of time encompassed by the 
Precambrian - more than 3.5 Ga - is about six times longer 
than the 570 Ma length of the Phanerozoic, and the proba¬ 
bility of remagnetization events is correspondingly higher. 
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Fig. 1.8 Paleomagnetic 
reconstruction of the relative 
positions of (a) Laurentia 
(North America and 
Greenland), Baltica and 
Gondwanaland (South 
America, Africa, Arabia, 
Australia, India and 
Antarctica) in the Late 
Ordovician and (b) Laurussia 
(North America and Baltica) 
and Gondwanaland in the 
Middle Silurian (after Van der 
Voo, 1993). 




In spite of some uncertainties, Early Paleozoic paleo- 
magnetism permits reassembly of the supercontinents 
Gondwanaland and Laurasia and traces their movements 
before they collided in the Carboniferous to form 
Pangaea. Geological and paleomagnetic evidence concur 
that, in the Cambrian period, Gondwanaland very likely 
existed as a supercontinent in essentially the du Toit con¬ 
figuration. It coexisted in the Early Paleozoic with three 
other cratonic centers: Laurentia (North America and 
Greenland), Baltica (northern Europe) and Siberia. 
Laurentia and Baltica were separated by the Iapetus 
ocean (Fig. 1.8a), which began to close in the Ordovician 
(about 450 Ma ago). Paleomagnetic data indicate that 
Laurentia and Baltica fused together around Late 
Silurian time to form the supercontinent Laurussia; at 
that time the Siberian block remained a separate entity. 
The Laurentia-Baltica collision is expressed in the 
Taconic and Caledonian orogenies in North America and 
northern Europe. The gap between Gondwanaland and 
Laurussia in the Middle Silurian (Fig. 1.8b) closed about 
the time of the Silurian-Devonian boundary (about 
410 Ma ago). Readjustments of the positions of the conti¬ 
nental blocks in the Devonian produced the Acadian 
orogeny. Laurussia separated from Gondwanaland in the 
Late Devonian, but the two supercontinents began to 
collide again in the Early Carboniferous (about 350 Ma 
ago), causing the Hercynian orogeny. By the Late 
Carboniferous (300Ma ago) Pangaea was almost com¬ 
plete, except for Siberia, which was probably appended in 
the Permian. 


The general configuration of Pangaea from the Late 
Carboniferous to the Early Jurassic is supported by 
paleomagnetic results from the Atlantic-bordering 
continents. However, the paleomagnetic data suggest that 
the purely geometric “Bullard-fit” is only appropriate for 
the later part of Pangaea's existence. The results for earlier 
times from the individual continents agree better for 
slightly different reconstructions (see Section 5.6.4.4). 
This suggests that some internal rearrangement of the 
component parts of Pangaea may have occurred. Also, the 
computer-assisted geometric assembly of Gondwanaland, 
similar to that proposed by du Toit, is not the only 
possible reconstruction, although paleomagnetic results 
confirm that it is probably the optimum one. Other 
models involve different relative placements of West 
Gondwanaland (i.e., South America and Africa) and East 
Gondwanaland (i.e., Antarctica, Australia and India), and 
imply that they may have moved relative to each other. 
The paleomagnetic data do not contradict the alternative 
models, but are not precise enough to discriminate defini¬ 
tively between them. 

The consistency of paleomagnetic results leaves little 
room for doubt that the continents have changed position 
relative to each other throughout geological time. This 
lends justification to the concept of continental drift, but 
it does not account for the mechanism by which it has 
taken place. Another aspect of the paleomagnetic record 
- the history of magnetic field polarity rather than the 
APW paths - has played a key role in deducing the mech¬ 
anism. The explanation requires an understanding of the 
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Fig. 1.9 Simplified layered 
structure of the Earth's 
interior showing the depths of 
the most important seismic 
discontinuities. 



Earth’s internal structure, the distribution of seismicity 
and the importance of the ocean basins. 

1.2.3 Earth structure 

Early in the twentieth century it became evident from the 
study of seismic waves that the interior of the Earth has a 
radially layered structure, like that of an onion ( Fig. 1.9). 
The boundaries between the layers are marked by abrupt 
changes in seismic velocity or velocity gradient. Each 
layer is characterized by a specific set of physical proper¬ 
ties determined by the composition, pressure and temper¬ 
ature in the layer. The four main layers are the crust, 
mantle and the outer and inner cores. Their properties are 
described in detail in Section 3.7 and summarized briefly 
here. 

At depths of a few tens of kilometers under continents 
and less than ten kilometers beneath the oceans seismic 
velocities increase sharply. This seismic discontinuity, dis¬ 
covered in 1909 by A. Mohorovigig, represents the 
boundary between the crust and mantle. R. D. Oldham 
noted in 1906 that the travel-times of seismic compres- 
sional waves that traversed the body of the Earth were 
greater than expected; the delay was attributed to a fluid 
outer core. Support for this idea came in 1914, when 
B. Gutenberg described a shadow zone for seismic waves 
at epicentral distances greater than about 105°. Just as 
light waves cast a shadow of an opaque object, seismic 
waves from an earthquake cast a shadow of the core on 


the opposite side of the world. Compressional waves can 
in fact pass through the liquid core. They appear, delayed 
in time, at epicentral distances larger than 143°. In 1936 
I. Lehmann observed the weak arrivals of compressional 
waves in the gap between 105° and 143°. They are inter¬ 
preted as evidence for a solid inner core. 

1.2.3.1 Lithospheric plates 

The radially layered model of the Earth’s interior 
assumes spherical symmetry. This is not valid for the crust 
and upper mantle. These outer layers of the Earth show 
important lateral variations. The crust and uppermost 
mantle down to a depth of about 70-100 km under deep 
ocean basins and 100-150 km under continents are rigid, 
forming a hard outer shell called the lithosphere. Beneath 
the lithosphere lies the asthenosphere, a layer in which 
seismic velocities often decrease, suggesting lower rigidity. 
It is about 150 km thick, although its upper and lower 
boundaries are not sharply defined. This weaker layer is 
thought to be partially molten; it may be able to flow over 
long periods of time like a viscous liquid or plastic solid, 
in a way that depends on temperature and composition. 
The asthenosphere plays an important role in plate tec¬ 
tonics, because it makes possible the relative motions of 
the overlying lithospheric plates. 

The brittle condition of the lithosphere causes it to frac¬ 
ture when strongly stressed. The rupture produces an 
earthquake, which is the violent release of elastic energy 
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Fig. 1.10 The geographical 
distribution of epicenters for 
30,000 earthquakes for the 
years 1961-1967 illustrates 
the tectonically active regions 
of the Earth (after Barazangi 
and Dorman, 1969). 



Fig. 1.11 The major and 
minor lithospheric plates. The 
arrows indicate relative 
velocities in mm yr -1 at active 
plate margins, as deduced 
from the model NUVEL-1 of 
current plate motions (data 
source: DeMets eta!., 1990). 
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due to sudden displacement on a fault plane. Earthquakes 
are not distributed evenly over the surface of the globe, but 
occur predominantly in well-defined narrow seismic zones 
that are often associated with volcanic activity (Fig. 1.10). 
These are: (a) the circum-Pacific “ring of fire”; (b) a 
sinuous belt running from the Azores through North 
Africa and the Alpine-Dinaride-Himalayan mountain 
chain as far as S.E. Asia; and (c) the world-circling system 
of oceanic ridges and rises. The seismic zones subdivide the 
lithosphere laterally into tectonic plates (Fig. 1.11). A plate 
may be as broad as 10,000 km (e.g., the Pacific plate) or as 
small as a few 1000 km (e.g., the Philippines plate). There 
are twelve major plates (Antarctica, Africa, Eurasia, India, 
Australia, Arabia, Philippines, North America, South 
America, Pacific, Nazca, and Cocos) and several minor 
plates (e.g., Scotia, Caribbean, Juan de Fuca). The posi¬ 
tions of the boundaries between the North American 
and South American plates and between the North 
American and Eurasian plates are uncertain. The bound¬ 


ary between the Indian and Australian plates is not sharply 
defined, but may be a broad region of diffuse deformation. 

A comprehensive model of current plate motions 
(called NUVEL-1), based on magnetic anomaly patterns 
and first-motion directions in earthquakes, shows rates of 
separation at plate boundaries that range from about 
20 mm yr -1 in the North Atlantic to about 160 mm yr -1 
on the East Pacific Rise (Fig. 1.11). The model also gives 
rates of closure ranging from about lOmni yr -1 between 
Africa and Eurasia to about 80 mm yr -1 between the 
Nazca plate and South America. 

1.2.4 Types of plate margin 

An important factor in the evolution of modern plate tec¬ 
tonic theory was the development of oceanography in the 
years following World War II, when technology designed 
for warfare was turned to peaceful purposes. The bathyme¬ 
try of the oceans was charted extensively by echo-sounding 
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and within a few years several striking features became 
evident. Deep trenches, more than twice the depth of the 
ocean basins, were discovered close to island arcs and some 
continental margins; the Marianas Trench is more than 
11 km deep. A prominent submarine mountain chain - 
called an oceanic ridge - was found in each ocean. The 
oceanic ridges rise to as much as 3000 m above the adjacent 
basins and form a continuous system, more than 
60,000 km in length, that girdles the globe. Unlike conti¬ 
nental mountain belts, which are usually less than several 
hundred kilometers across, the oceanic ridges are 
2000-4000 km in width. The ridge system is offset at inter¬ 
vals by long horizontal faults forming fracture zones. 
These three features - trenches, ridges and fracture zones - 
originate from different plate tectonic processes. 

The lithospheric plates are very thin in comparison to 
their breadth (compare Fig. 1.9 and Fig. 1.11). Most 
earthquakes occur at plate margins, and are associated 
with interactions between plates. Apart from rare 
intraplate earthquakes, which can be as large and disas¬ 
trous as the earthquakes at plate boundaries, the plate 
interiors are aseismic. This suggests that the plates behave 
rigidly. Analysis of earthquakes allows the direction of 
displacement to be determined and permits interpreta¬ 
tion of the relative motions between plates. 

There are three types of plate margin, distinguished by 
different tectonic processes (Fig. 1.12). The world-wide 
pattern of earthquakes shows that the plates are presently 
moving apart at oceanic ridges. Magnetic evidence, dis¬ 
cussed below, confirms that the separation has been going 
on for millions of years. New lithosphere is being formed 
at these spreading centers, so the ridges can be regarded as 
constructive plate margins. The seismic zones related to 
deep-sea trenches, island arcs and mountain belts mark 
places where lithospheric plates are converging. One plate 
is forced under another there in a so-called subduction 
zone. Because it is thin in relation to its breadth, the lower 
plate bends sharply before descending to depths of 
several hundred kilometers, where it is absorbed. The sub¬ 
duction zone marks a destructive plate margin. 

Constructive and destructive plate margins may consist 
of many segments linked by horizontal faults. A crucial 
step in the development of plate tectonic theory was made 
in 1965 by a Canadian geologist, J. Tuzo Wilson, who rec¬ 
ognized that these faults are not conventional transcurrent 
faults. They belong to a new class of faults, which Wilson 
called transform faults. The relative motion on a transform 
fault is opposite to what might be inferred from the off¬ 
sets of bordering ridge segments. At the point where a 
transform fault meets an oceanic ridge it transforms the 
spreading on the ridge to horizontal shear on the fault. 
Likewise, where such a fault meets a destructive plate 
margin it transforms subduction to horizontal shear. 

The transform faults form a conservative plate margin, 
where lithosphere is neither created nor destroyed; the 
boundary separates plates that move past each other hor¬ 
izontally. This interpretation was documented in 1967 by 



Fig. 1.12 Schematic model illustrating the three types of plate margin. 
Lightly hachured areas symbolize spreading ridges (constructive 
margins); darker shaded areas denote subduction zones (destructive 
margins); dark lines mark transform faults (conservative margins). The 
figure is drawn relative to the pole of relative motion between plates A 
and B. Small arrows denote relative motion on transform faults; large 
arrows show directions of plate motion, which can be oblique to the 
strike of ridge segments or subduction zones. Arrows in the 
asthenosphere suggest return flow from destructive to constructive 
margins. 

L. Sykes, an American seismologist. Ffe showed that 
earthquake activity on an oceanic ridge system was con¬ 
fined almost entirely to the transform fault between ridge 
crests, where the neighboring plates rub past each other. 
Most importantly, Sykes found that the mechanisms of 
earthquakes on the transform faults agreed with the pre¬ 
dicted sense of strike-slip motion. 

Transform faults play a key role in determining plate 
motions. Spreading and subduction are often assumed to 
be perpendicular to the strike of a ridge or trench, as is 
the case for ridge X in Fig. 1.12. This is not necessarily the 
case. Oblique motion with a component along strike is 
possible at each of these margins, as on ridge Y. Flowever, 
because lithosphere is neither created nor destroyed at a 
conservative margin, the relative motion between adja¬ 
cent plates must be parallel to the strike of a shared trans¬ 
form fault. Pioneering independent studies by D. P. 
McKenzie and R. L. Parker (1967) and W. J. Morgan 
(1968) showed how transform faults could be used to 
locate the Euler pole of rotation for two plates (see 
Section 1.2.9). Using this method, X. Le Pichon in 1968 
determined the present relative motions of the major tec¬ 
tonic plates. In addition, he derived the history of plate 
motions in the geological past by incorporating newly 
available magnetic results from the ocean basins. 

1.2.5 Sea-floor spreading 

One of the principal stumbling blocks of continental drift 
was the inability to explain the mechanism by which drift 
took place. Wegener had invoked forces related to gravity 
and the Earth’s rotation, which were demonstrably much 
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Fig. 1.13 Symmetric striped 
pattern of magnetic 
anomalies on the Reykjanes 
segment of the Mid-Atlantic 
Ridge southwest of Iceland. 
The positive anomalies are 
shaded according to their 
age, as indicated in the 
vertical column (after Heirtzler 
etal., 1966). 




too weak to drive the continents through the resistant 
basaltic crust. A. Holmes proposed a model in 1944 that 
closely resembles the accepted plate tectonic model 
(Holmes, 1965). He noted that it would be necessary to 
remove basaltic rocks continuously out of the path of an 
advancing continent, and suggested that this took place at 
the ocean deeps where heavy eclogite “roots” would sink 
into the mantle and melt. Convection currents in the 
upper mantle would return the basaltic magma to the con¬ 
tinents as plateau basalts, and to the oceans through innu¬ 
merable fissures. Holmes saw generation of new oceanic 
crust as a process that was dispersed throughout an ocean 
basin. At the time of his proposal the existence of the 
system of oceanic ridges and rises was not yet known. 

The important role of oceanic ridges was first recog¬ 
nized by H. Hess in 1962. He suggested that new oceanic 
crust is generated from upwelling hot mantle material at 
the ridges. Convection currents in the upper mantle 
would rise to the surface at the ridges and then spread out 
laterally. The continents would ride on the spreading 
mantle material, carried along passively by the convec¬ 
tion currents. In 1961 R. Dietz coined the expression 
“sea-floor spreading” for the ridge process. This results in 
the generation of lineated marine magnetic anomalies at 
the ridges, which record the history of geomagnetic 
polarity reversals. Study of these magnetic effects led to 
the verification of sea-floor spreading. 

1.2.5.1 The Vine-Matthews-Morleyhypothesis 

Paleomagnetic studies in the late 1950s and early 1960s of 
radiometrically dated continental lavas showed that the 
geomagnetic field has changed polarity at irregular time 


intervals. For tens of thousands to millions of years the 
polarity might be normal (as at present), then unaccount¬ 
ably the poles reverse within a few thousand years, so that 
the north magnetic pole is near the south geographic pole 
and the south magnetic pole is near the north geographic 
pole. This state may again persist for a long interval, 
before the polarity again switches. The ages of the rever¬ 
sals in the last 5 million years have been obtained radio¬ 
metrically, giving an irregular but dated polarity sequence. 

A magnetic anomaly is a departure from the theoretical 
magnetic field at a given location. If the field is stronger 
than expected, the anomaly is positive; if it is weaker than 
expected, the anomaly is negative. In the late 1950s mag¬ 
netic surveys over the oceans revealed remarkable striped 
patterns of alternately positive and negative magnetic 
anomalies over large areas of oceanic crust (Fig. 1.13), for 
which conventional methods of interpretation gave no sat¬ 
isfactory account. In 1963 the English geophysicists F. J. 
Vine and D. H. Matthews and, independently, the 
Canadian geologist L. W. Morley, formulated a landmark 
hypothesis that explains the origin of the oceanic mag¬ 
netic anomaly patterns (see also Section 5.7.3). 

Observations on dredged samples had shown that 
basalts in the uppermost oceanic crust carry a strong rema¬ 
nent magnetization (i.e., they are permanently magnetized, 
like a magnet). The Vine-Matthews-Morley hypothesis 
integrates this result with the newly acquired knowledge of 
geomagnetic polarity reversals and the Hess-Dietz 
concept of sea-floor spreading (Fig. 1.14). The basaltic 
lava is extruded in a molten state. When it solidifies and its 
temperature cools below the Curie temperature of its mag¬ 
netic minerals, the basalt becomes strongly magnetized in 
the direction of the Earth's magnetic field at that time. 
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Fig. 1.14 Upper, observed 
and computed marine 
magnetic anomalies, in 
nanotesla (nT), across the 
Pacific-Antarctica ridge, and 
(lower) their interpreted origin 
in terms of the 
Vine-Matthews-Morley 
hypothesis (after Pitman and 
Heirtzler, 1966). 




Along an active spreading ridge, long thin strips of magne¬ 
tized basaltic crust form symmetrically on opposite sides 
of the spreading center, each carrying the magnetic imprint 
of the field in which it formed. Sea-floor spreading can 
persist for many millions of years at an oceanic ridge. 
During this time the magnetic field changes polarity many 
times, forming strips of oceanic crust that are magnetized 
alternately parallel and opposite to the present field, giving 
the observed patterns of positive and negative anomalies. 
Thus, the basaltic layer acts like a magnetic tape recorder, 
preserving a record of the changing geomagnetic field 
polarity. 

1.25.2 Rates of sea-floor spreading 

The width of a magnetic lineation (or stripe) depends on 
two factors: the speed with which the oceanic crust 
moves away from a spreading center, and the length of 
time that geomagnetic polarity is constantly normal or 
reversed. The distance between the edges of magnetized 
crustal stripes can be measured from magnetic surveys at 
the ocean surface, while the ages of the reversals can be 
obtained by correlating the oceanic magnetic record with 
the radiometrically dated reversal sequence determined 
in subaerial lavas for about the last 4Ma. When the dis¬ 
tance of a given polarity reversal from the spreading axis 
is plotted against the age of the reversal, a nearly linear 
relationship is obtained (Fig. 1.15). The slope of the best 
fitting straight line gives the average half-rate of spread¬ 
ing at the ridge. These are of the order of 10 mm yr -1 in 
the North Atlantic ocean and 40-60mmyr _1 in the 
Pacific ocean. The calculation applies to the rate of 
motion of crust on one side of the ridge only. In most 
cases spreading has been symmetric on each side of the 
ridge (i.e., the opposite sides are moving away from the 
ridge at equal speeds), so the full rate of separation at a 
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Fig. 1.15 Computation of half-rates of sea-floor spreading at different 
spreading centers by measuring the distances to anomalies with known 
radiometric ages (after Vine, 1966). 


ridge axis is double the calculated half-rate of spreading 
(Fig. 1.11). 

The rates of current plate motion determined from 
axial anomaly patterns (Fig. 1.11) are average values over 
several million years. Modern geodetic methods allow 
these rates to be tested directly (see Section 2.4.6). Satellite 
laser-ranging (SLR) and very long baseline interferometry 
(VLBI) allow exceptionally accurate measurement of 
changes in the distance between two stations on Earth. 
Controlled over several years, the distances between pairs 
of stations on opposite sides of the Atlantic ocean are 
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Fig. 1.16 Changes in 
separation between Westcott 
(Massachusetts, USA) and (a) 
Onsala (Sweden) and (b) 
Wettzell (Germany), as 
determined by very long 
baseline interferometry (after 
Ryan etal., 1993). 


(a) North America - Sweden (Onsala) 



Year 


(b) North America - Germany (Wettzell) 



Year 


increasing slowly at a mean rate of 17 mm yr -1 (Fig. 1.16). 
This figure is close to the long-term value of about 
20 mm yr -1 interpreted from model NUVEL-1 of current 
plate motions (Fig. 1.11). 

Knowing the spreading rates at ocean ridges makes it 
possible to date the ocean floor. The direct correlation 
between polarity sequences measured in continental lavas 
and derived from oceanic anomalies is only possible for 
the last 4 Ma or so. Close to the axial zone, where linear 
spreading rates are observed (Fig. 1.15), simple extrapola¬ 
tion gives the ages of older anomalies, converting the 
striped pattern into an age map (Fig. 1.13). Detailed mag¬ 
netic surveying of much of the world's oceans has 
revealed a continuous sequence of anomalies since the 
Late Cretaceous, preceded by an interval in which no 
reversals occurred; this Quiet Interval was itself preceded 
by a Mesozoic reversal sequence. Magnetostratigraphy in 
sedimentary rocks (Section 5.7.4) has enabled the identifi¬ 
cation, correlation and dating of key anomalies. The 
polarity sequence of the oceanic anomalies has been con¬ 
verted to a magnetic polarity timescale in which each 
polarity reversal is accorded an age (e.g., as in Fig. 5.78). 
In turn, this allows the pattern of magnetic anomalies in 


the ocean basins to be converted to a map of the age of the 
ocean basins (Fig. 5.82). The oldest areas of the oceans lie 
close to northwest Africa and eastern North America, as 
well as in the northwest Pacific. These areas formed 
during the early stages of the breakup of Pangaea. They 
are of Early Jurassic age. The ages of the ocean basins 
have been confirmed by drilling through the sediment 
layers that cover the ocean floor and into the underlying 
basalt layer. Beginning in the late 1960s and extending 
until the present, this immensely expensive undertaking 
has been carried out in the Deep Sea Drilling Project 
(DSDP) and its successor the Ocean Drilling Project 
(ODP). These multinational projects, under the leader¬ 
ship of the United States, are prime examples of open sci¬ 
entific cooperation on an international scale. 

1.2.6 Plate margins 

It is important to keep in mind that the tectonic plates are 
not crustal units. They involve the entire thickness of the 
lithosphere, of which the crust is only the outer skin. 
Oceanic lithosphere is thin close to a ridge axis, but thick¬ 
ens with distance from the ridge, reaching a value of 



26 


The Earth as a planet 


Fig. 1.17 Hypothetical 
vertical cross-section through 
a lithospheric plate from a 
spreading center to a 
subduction zone. 


Subduction Zone 



80-100 km; the oceanic crust makes up only the top 
5-10 km. Continental lithosphere may be up to 150 km 
thick, of which only the top 30-60 km is continental 
crust. Driven by mechanisms that are not completely 
understood, the lithospheric plates move relative to each 
other across the surface of the globe. This knowledge 
supplies the “missing link” in Wegener’s continental drift 
hypothesis, removing one of the most serious objections 
to it. It is not necessary for the continents to plow 
through the rigid ocean basins; they are transported pas¬ 
sively on top of the moving plates, as logs float on a 
stream. Continental drift is thus a consequence of plate 
motions. 

The plate tectonic model involves the formation of new 
lithosphere at a ridge and its destruction at a subduction 
zone (Fig. 1.17). Since the mean density of oceanic lithos¬ 
phere exceeds that of continental lithosphere, oceanic 
lithosphere can be subducted under continental or oceanic 
lithosphere, whereas continental lithosphere cannot under¬ 
ride oceanic lithosphere. Just as logs pile up where a stream 
dives under a surface obstacle, a continent that is trans¬ 
ported into a subduction zone collides with the deep-sea 
trench, island arc or adjacent continent. Such a collision 
results in an orogenic belt. In a continent-continent colli¬ 
sion, neither plate can easily subduct, so relative plate 
motion may come to a halt. Alternatively, subduction may 
start at a new location behind one of the continents, leaving 
a mountain chain as evidence of the suture zone between 
the original colliding continents. The Alpine-Himalayan 
and Appalachian mountain chains are thought to have 
formed by this mechanism, the former in Tertiary times, the 
latter in several stages during the Paleozoic. Plate tectonic 
theory is supported convincingly by an abundance of geo¬ 
physical, petrological and geological evidence from the 
three types of plate margin. A brief summary of the main 
geophysical observations at these plate margins is given in 
the following sections. Later chapters give more detailed 
treatments of the gravity (Section 2.6.4), seismicity 
(Sections 3.5.3 and 3.5.4), geothermal (Section 4.2.5) and 
magnetic (Section 5.7.3) evidence. 


1.2.6.1 Constructive margins 

Although the ridges and rises are generally not centrally 
located in the ocean basins, they are often referred to 
as mid-ocean ridges. The type of oceanic basalt that 
is produced at an oceanic spreading center is even 
called a mid-ocean ridge basalt (MORB for short). 
Topographically, slow-spreading ridges have a distinct 
axial rift valley, which, for reasons that are not under¬ 
stood, is missing on faster-spreading ridges. Partially 
molten upper mantle rocks (generally assumed to be peri- 
dotites) from the asthenosphere rise under the ridges. The 
decrease in pressure due to the changing depth causes 
further melting and the formation of basaltic magma. 
Their chemical compositions and the concentrations of 
long-lived radioactive isotopes suggest that MORB lavas 
are derived by fractionation (i.e., separation of compo¬ 
nents, perhaps by precipitation or crystallization) from 
the upwelling peridotitic mush. Differentiation is thought 
to take place at about the depth of the lower crustal gab- 
broic layer beneath the ridge in a small, narrow magma 
chamber. Some of the fluid magma extrudes near the 
central rift or ridge axis and flows as lava across the ocean 
floor; part is intruded as dikes and sills into the thin 
oceanic crust. The Vine-Matthews-Morley hypothesis 
for the origin of oceanic magnetic anomalies requires 
fairly sharp boundaries between alternately magnetized 
blocks of oceanic crust. This implies that the zone of dike 
injection is narrow and close to the ridge axis. 

The distribution of earthquakes defines a narrow band 
of seismic activity close to the crest of an oceanic ridge. 
These earthquakes occur at shallow depths of a few kilo¬ 
meters and are mostly small; magnitudes of 6 or greater 
are rare. The seismic energy released at ridges is an 
insignificant part of the world-wide annual release. 
Analyses show that the earthquakes are associated with 
normal faulting, implying extension away from the ridge 
axis (see Section 3.5.4). 

Heat flow in the oceans is highest at the ocean ridges 
and decreases systematically with distance away from the 
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ridge. The thermal data conform to the model of sea¬ 
floor spreading. High axial values are caused by the for¬ 
mation of new lithosphere from the hot uprising magma 
at the ridge axis. The associated volcanism on the floor 
of the axial rift zones has been observed directly from 
deep-diving submersibles. With time, the lithosphere 
spreads away from the ridge and gradually cools, so that 
the heat outflow diminishes with increasing age or dis¬ 
tance from the ridge. 

Oceanic crust is thin, so the high-density mantle rocks 
occur at shallower depths than under the continents. This 
causes a general increase of the Earth’s gravity field over 
the oceans, giving positive gravity anomalies. However, 
over the ridge systems gravity decreases toward the axis so 
that a “negative” anomaly is superposed on the normally 
positive oceanic gravity anomaly. The effect is due to the 
local density structure under the ridge. It has been inter¬ 
preted in terms of anomalous mantle material with 
density slightly less than normal. The density is low 
because of the different mantle composition under the 
ridges and its high temperature. 

The interpretation of magnetic anomalies formed by 
sea-floor spreading at constructive margins has already 
been discussed. The results provide direct estimates of 
the mean rates of plate motions over geological time 
intervals. 

1.2.6.2 Destructive margins 

Subduction zones are found where a plate plunges 
beneath its neighbor to great depths, until pressure and 
temperature cause its consumption. This usually happens 
within a few hundred kilometers, but seismic tomography 
(Section 3.7.6) has shown that some descending slabs may 
sink to great depths, even to the core-mantle boundary. 
Density determines that the descending plate at a subduc¬ 
tion zone is an oceanic one. The surface manifestation 
depends on the type of overriding plate. When this is 
another oceanic plate, the subduction zone is marked by a 
volcanic island arc and, parallel to it, a deep trench. The 
island arc lies near the edge of the overriding plate and is 
convex toward the underriding plate. The trench marks 
where the underriding plate turns down into the mantle 
(Fig. 1.17). It may be partly filled with carbonaceous and 
detrital sediments. Island arc and trench are a few 
hundred kilometers apart. Several examples are seen 
around the west and northwest margins of the Pacific 
plate (Fig. 1.11). Melting of the downgoing slab produces 
magma that rises to feed the volcanoes. 

The intrusion of magma behind an island arc pro¬ 
duces a back-arc basin on the inner, concave side of the 
arc. These basins are common in the Western Pacific. If 
the arc is close to a continent, the off-arc magmatism may 
create a marginal sea, such as the Sea of Japan. Back-arc 
basins and marginal seas are floored by oceanic crust. 

A fine example of where the overriding plate is a conti¬ 
nental one is seen along the west coast of South America. 


a b c d 



Fig. 1.18 Stresses acting on a subducting lithospheric plate. Arrows 
indicate shear where the underriding plate is bent downward. Solid and 
open circles within the descending slab denote extension and 
compression, respectively; the size of the circle represents qualitatively 
the seismic activity. In (a), (b) and (d) extensional stress in the upper part 
of the plate is due to the slab being pulled into low-strength 
asthenosphere. In (b) resistance of the more rigid layer under the 
asthenosphere causes compression within the lower part of the slab; if 
the plate sinks far enough, (c), the stress becomes compressional 
throughout; in some cases, (d), the deep part of the lower slab may 
break off (after Isacks and Molnar, 1969). 

Compression between the Nazca and South American 
plates has generated the Andes, an arcuate-folded moun¬ 
tain belt near the edge of the continental plate. Active vol¬ 
canoes along the mountain chain emit a type of lava, called 
andesite, which has a higher silica content than oceanic 
basalt. It does not originate from the asthenosphere type of 
magma. A current theory is that it may form by melting of 
the subducting slab and overriding plate at great depths. If 
some siliceous sediments from the deep-sea trench are 
carried down with the descending slab, they might enhance 
the silica content of the melt, producing a magma with 
andesite-type composition. 

The seismicity at a subduction zone provides the key to 
the processes active there. Where one plate is thrust over 
the other, the shear causes hazardous earthquakes at 
shallow depths. Below this region, earthquakes are sys¬ 
tematically distributed within the subducting plate. They 
form an inclined Wadati-Beniojf seismic zone, which may 
extend for several hundred kilometers into the mantle. 
The deepest earthquakes have been registered down to 
about 700 km. 

Studies of the focal mechanisms (Section 3.5.4) show 
that at shallow depths the downgoing plate is in a state of 
down-dip extension (Fig. 1.18a). Subducting lithosphere 
is colder and denser than the underlying asthenosphere. 
This gives it negative buoyancy, which causes it to sink, 
pulling the plate downward. At greater depths the mantle 
is more rigid than the asthenosphere, and its strength 
resists penetration (Fig. 1.18b). While the upper part is 
sinking, the bottom part is being partly supported by the 
deeper layers; this results in down-dip compression in the 
lower part of the descending slab and down-dip extension 
in the upper part. A gap in the depth distribution of seis¬ 
micity may arise where the deviatoric stress changes from 
extensional to compressional. In a very deep subduction 
zone the increase in resistance with depth causes down- 
dip compression throughout the descending slab (Fig. 
1.18c). In some cases part of the slab may break off and 
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sink to great depths, where the earthquakes have com- 
pressional-type mechanisms (Fig. 1.18d); a gap in seis¬ 
micity exists between the two parts of the slab. 

Heat flow at a destructive plate margin reflects to some 
extent the spreading history of the plate. The plate 
reaches its maximum age, and so has cooled the furthest, 
by the time it reaches a subduction zone. The heat flow 
values over deep ocean basins are uniformly low, but the 
values measured in deep-sea trenches are the lowest found 
in the oceans. In contrast, volcanic arcs and back-arc 
basins often have anomalously high heat flow due to the 
injection of fresh magma. 

Gravity anomalies across subduction zones have 
several distinctive features. Seaward of the trench the 
lithosphere flexes upward slightly before it begins its 
descent, causing a weak positive anomaly; the presence of 
water or low-density sediments in a deep-sea trench gives 
rise to a strong negative gravity anomaly; and over the 
descending slab a positive anomaly is observed, due in 
part to the mineralogical conversion of subducted 
oceanic crust to higher-density eclogite. 

Subduction zones have no particular magnetic signa¬ 
ture. Close to an active or passive continental margin the 
contrast between the magnetic properties of oceanic and 
continental crust produces a magnetic anomaly, but this 
is not a direct result of the plate tectonic processes. Over 
marginal basins magnetic anomalies are not lineated 
except in some rare cases. This is because the oceanic 
crust in the basin does not originate by sea-floor spread¬ 
ing at a ridge, but by diffuse intrusion throughout the 
basin. 

1.2.6.3 Conservative margins 

Transform faults are strike-slip faults with steeply 
dipping fault planes. They may link segments of subduc¬ 
tion zones, but they are mostly observed at constructive 
plate margins where they connect oceanic ridge segments. 
Transform faults are the most seismically active parts of a 
ridge system, because here the relative motion between 
neighboring plates is most pronounced. Seismic studies 
have confirmed that the displacements on transform 
faults agree with the relative motion between the adjacent 
plates. 

The trace of a transform fault may extend away from a 
ridge on both sides as a fracture zone. Fracture zones are 
among the most dramatic features of ocean-floor topog¬ 
raphy. Although only some tens of kilometers wide, a 
fracture zone can be thousands of kilometers long. It 
traces the arc of a small circle on the surface of the globe. 
This important characteristic allows fracture zones to be 
used for the deduction of relative plate motions, which 
cannot be obtained from the strike of a ridge or trench 
segment, where oblique spreading or subduction is 
possible (note, for example, the direction of plate conver¬ 
gence relative to the strike of the Aleutian island arc in 
Fig. 1.11). 


Any displacement on the surface of a sphere is equiv¬ 
alent to a small rotation about a pole. The motion of one 
plate relative to the other takes place as a rotation about 
the Euler pole of relative rotation between the plates (see 
Section 1.2.9). This pole can be located from the orienta¬ 
tions of fracture zones, because the strike of a transform 
fault is parallel to the relative motion between two adja¬ 
cent plates. Thus a great circle normal to a transform 
fault or fracture zone must pass through the Euler pole 
of relative rotation between the two plates. If several 
great circles are drawn at different places on the 
fracture zone (or normal to different transform faults 
offsetting a ridge axis) they intersect at the Euler pole. 
The current model of relative plate motions NUVEL-1 
was obtained by determining the Euler poles of rotation 
between pairs of plates using magnetic anomalies, the 
directions of slip on earthquake fault planes at plate 
boundaries, and the topography that defines the strikes 
of transform faults. The rates of relative motion at 
different places on the plate boundaries (Fig. 1.11) were 
computed from the rates of rotation about the appropri¬ 
ate Euler poles. 

There may be a large change in elevation across a frac¬ 
ture zone; this is related to the different thermal histories 
of the plates it separates. As a plate cools, it becomes 
more dense and less buoyant, so that it gradually sinks. 
Consequently, the depth to the top of the oceanic lithos¬ 
phere increases with age, i.e., with distance from the 
spreading center. Places facing each other across a trans¬ 
form fault are at different distances from their respective 
spreading centers. They have different ages and so have 
subsided by different amounts relative to the ridge. This 
may result in a noticeable elevation difference across the 
fracture zone. 

Ultrabasic rocks are found in fracture zones and there 
may be local magnetic anomalies. Otherwise, the mag¬ 
netic effect of a transform fault is to interrupt the oceanic 
magnetic lineations parallel to a ridge axis, and to offset 
them in the same measure as ridge segments. This results 
in a very complex pattern of magnetic lineations in some 
ocean basins (e.g., in the northeast Pacific). 

A transform fault can also connect subduction zones. 
Suppose a consuming plate boundary consisted origi¬ 
nally of two opposed subduction zones (Fig. 1.19a). Plate 
Y is consumed below plate X along the segment ab of the 
boundary, whereas plate X is consumed beneath plate Y 
along segment be. The configuration is unstable, because 
a trench cannot sustain subduction in opposite directions. 
Consequently, a dextral transform fault develops at the 
point b. After some time, motion on the fault displaces 
the lower segment to the position b'c (Fig. 1.19b). An 
example of such a transform boundary is the Alpine fault 
in New Zealand (Fig. 1.19c). To the northeast of North 
Island, the Pacific plate is being subducted at the 
Tonga-Kermadec trench. To the southwest of South 
Island, the Pacific plate overrides the Tasman Sea at the 
anomalous Macquarie Ridge (earthquake analysis has 
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Fig. 1.19 (a) A consuming plate boundary consisting of two opposed 
subduction zones; along ab plate Y is consumed below plate X and 
along be plate X is consumed beneath plate Y. (b) Development of a 
transform fault which displaces be to the position b'e. (c) The Alpine 
fault in New Zealand is an example of such a transform boundary (after 
McKenzie and Morgan, 1969). 

shown that the plate margin at this ridge is compressive; 
the compression may be too slow to allow a trench to 
develop). The Alpine fault linking the two opposed sub¬ 
duction zones is therefore a dextral transform fault. 

1.2.7 Triple junctions 

It is common, although imprecise, to refer to a plate 
margin by its dominant topographic feature, rather than 
by the nature of the margin. A ridge (R) represents a con¬ 
structive margin or spreading center, a trench (T) refers to 
a destructive margin or subduction zone, and a transform 
fault (F) stands for a conservative margin. Each margin is 
a location where two tectonic plates adjoin. Inspection of 
Fig. 1.11 shows that there are several places where three 
plates come together, but none where four or more plates 
meet. The meeting points of three plate boundaries are 
called triple junctions. They are important in plate tecton¬ 
ics because the relative motions between the plates that 
form a triple junction are not independent. This may be 
appreciated by considering the plate motions in a small 
plane surrounding the junction. 

Consider the plate velocities at an RTF junction 
formed by all three types of boundary (Fig. 1.20a). If the 
plates are rigid, their relative motions take place entirely at 
their margins. Let A V B denote the velocity of plate B rela¬ 
tive to plate A, B V C the velocity of plate C relative to plate 
B, and C V A the velocity of plate A relative to plate C. Note 
that these quantities are vectors; their directions are as 


(a) (b) 





ridge transform subduction 
fault zone (trench) 


Fig. 1.20 (a) Triple junction formed by a ridge, trench and transform 
fault, and (b) vector diagram of the relative velocities at the three 
boundaries (after McKenzie and Parker, 1967). 

important as their magnitudes. They can be represented 
on a vector diagram by straight lines with directions paral¬ 
lel to and lengths proportional to the velocities. In a circuit 
about the triple junction an observer must return to the 
starting point. Thus, a vector diagram of the interplate 
velocities is a closed triangle (Fig. 1.20b). The velocities 
are related by 

aV b + bV c + cVa = 0 (1-7) 

This planar model is a “flat Earth” representation. As 
discussed in Section 1.2.9, displacements on the surface 
of a sphere are rotations about Euler poles of relative 
motion. This can be taken into account by replacing each 
linear velocity V in Eq. (1.7) by the rotational velocity to 
about the appropriate Euler pole. 

1.2.7.1 Stability of triple junctions 

The different combinations of three plate margins define 
ten possible types of triple junction. The combinations 
correspond to all three margins being of one type (RRR, 
TTT, FFF), two of the same type and one of the other 
(RRT, RRF, FFT, FFR, TTR, TTF), and all different 
(RTF). Different combinations of the sense of subduc¬ 
tion at a trench increase the number of possible junctions 
to sixteen. Not all of these junctions are stable in time. 
For a junction to preserve its geometry, the orientations 
of the three plate boundaries must fulfil conditions which 
allow the relative velocities to satisfy Eq. (1.7). If they do 
so, the junction is stable and can maintain its shape. 
Otherwise, the junction is unstable and must evolve in 
time to a stable configuration. 

The stability of a triple junction is assessed by consid¬ 
ering how it can move along any of the plate boundaries 
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Fig. 1.21 Plate margin geometry (left) and locus ab of a triple junction 
in velocity space (right) for (a) a trench, (b) a transform fault, and (c) a 
ridge (after Cox and Hart, 1986). 
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Fig. 1.22 Triple junction configuration (left), velocity lines of each 
margin in velocity space (center), and stability criteria (right) for selected 
triple junctions, TJ (after Cox and Hart, 1986). 


that form it. The velocity of a plate can be represented by 
its coordinates in velocity space. Consider, for example, a 
trench or consuming plate margin (Fig. 1.21a). The point 
A in velocity space represents the consuming plate, which 
has a larger velocity than B for the overriding plate. A 
triple junction in which one plate margin is a trench can lie 
anywhere on this boundary, so the locus of its possible 
velocities is a line ab parallel to the trench. The trench is 
fixed relative to the overriding plate B, so the line ab must 
pass through B. Similar reasoning shows that a triple junc¬ 
tion on a transform fault is represented in velocity space 
by a line ab parallel to the fault and passing through both 
A and B (Fig. 1.21b). A triple junction on a ridge gives a 
velocity line ab parallel to the ridge; in the case of symmet¬ 
rical spreading normal to the trend of the ridge the line ab 
is the perpendicular bisector of AB (Fig. 1.21c). 

Now consider the RRR-type of triple junction, formed 
by three ridges (Fig. 1.22a). The locus of the triple junction 
on the ridge between any pair of plates is the perpendicular 
bisector of the corresponding side of the velocity triangle 
ABC. The perpendicular bisectors of the sides of a triangle 
always meet at a point (the circumcenter). In velocity space 
this point satisfies the velocities on all three ridges simulta¬ 
neously, so the RRR triple junction is always stable. 
Conversely, a triple junction formed by three intersecting 
transform faults (FFF) is always unstable, because the 


velocity lines form the sides of a triangle, which can never 
meet in a point (Fig. 1.22b). The other types of triple junc¬ 
tion are conditionally stable, depending on the angles 
between the different margins. For example, in an RTF 
triple junction the velocity lines of the trench ac and trans¬ 
form fault be must both pass through C, because this plate 
is common to both boundaries. The junction is stable if the 
velocity line ab of the ridge also passes through C, or if the 
trench and transform fault have the same trend (Fig. 
1.22c). By similar reasoning, the FFT triple junction is 
only stable if the trench has the same trend as one of the 
transform faults (Fig. 1.22d). 

In the present phase of plate tectonics only a few of the 
possible types of triple junction appear to be active. An 
RRR-type is formed where the Galapagos Ridge meets 
the East Pacific Rise at the junction of the Cocos, Nazca 
and Pacific plates. A TTT-type junction is formed by the 
Japan trench and the Bonin and Ryukyu arcs. The San 
Andreas fault in California terminates in an FFT-type 
junction at its northern end, where it joins the Mendocino 
Fracture Zone. 

1.2.7.2 Evolution of triple junctions in the northeast Pacific 

Oceanic magnetic anomalies in the northeast Pacific form 
a complex striped pattern. The anomalies can be identified 
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by interpreting their shapes. Their ages can be found by 
comparison with a geomagnetic polarity timescale such as 
that shown in Fig. 5.78, which gives the age of each num¬ 
bered chron since the Late Jurassic. In the northeast 
Pacific the anomalies become younger toward the North 
American continent in the east, and toward the Aleutian 
trench in the north. The anomaly pattern produced at a 
ridge is usually symmetric (as in Fig. 1.13), but in the 
northeast Pacific only the western half of an anomaly 
pattern is observed. The plate on which the eastern half of 
the anomaly pattern was formed is called the Farallon 
plate. It and the ridge itself are largely missing and have 
evidently been subducted under the American plate. Only 
two small remnants of the Farallon plate still exist: the 
Juan de Fuca plate off the coast of British Columbia, and 
the Rivera plate at the mouth of the Gulf of California. 
The magnetic anomalies also indicate that another plate, 
the Kula plate, existed in the Late Mesozoic but has now 
been entirely consumed under Alaska and the Aleutian 
trench. The anomaly pattern shows that in the Late 
Cretaceous the Pacific, Kula and Farallon plates were 
diverging from each other and thus met at an RRR-type 
triple junction. This type of junction is stable and pre¬ 
served its shape during subsequent evolution of the plates. 
It is therefore possible to reconstruct the relative motions 
of the Pacific, Kula and Farallon plates in the Cenozoic 
(Fig. 1.23a-c). 

The anomaly ages are known from the magnetic 
timescale so the anomaly spacing allows the half-rates of 
spreading to be determined. In conjunction with the trends 
of fracture zones, the anomaly data give the rates and 
directions of spreading at each ridge. The anomaly pattern 
at the mouth of the Gulf of California covers the last 4 Ma 
and gives a mean half-rate of spreading of 3 cm yr -1 paral¬ 
lel to the San Andreas fault. This indicates that the Pacific 
plate has moved northward past the American plate at this 
boundary with a mean relative velocity of about 6 cm yr -1 
during the last 4Ma. The half-rate of spreading on the 
remnant of the Farallon-Pacific ridge is 5 cm yr -1 , giving 
a relative velocity of 10 cm yr -1 between the plates. A 
vector diagram of relative velocities at the Farallon- 
Pacific-American triple junction (Fig. 1.23d) shows con¬ 
vergence of the Farallon plate on the American plate at a 
rate of 7 cm yr -1 . Similarly, the spacing of east-west trend¬ 
ing magnetic anomalies in the Gulf of Alaska gives the 
half-rate of spreading on the Kula-Pacific ridge, from 
which it may be inferred that the relative velocity between 
the plates was 7 cm yr -1 . A vector diagram combining 
this value with the 6 cm yr -1 northward motion of the 
Pacific plate gives a velocity of 12 cm yr -1 for the Kula 
plate relative to the American plate. 

Using these velocities the history of plate evolution in 
the Cenozoic can be deduced by extrapolation. The inter¬ 
pretation is tenuous, as it involves unverifiable assump¬ 
tions. The most obvious is that the Kula-Pacific motion 
in the late Cretaceous (80 Ma ago) and the American- 
Pacific motion of the past 4Ma have remained constant 


(a) 20 Ma 




Fig. 1.23 (a)-(c) Extrapolated plate relationships in the northeast Pacific 
at different times in the Cenozoic (after Atwater, 1970). Letters on the 
American plate give approximate locations of some modern cities for 
reference: MC, Mexico City; LA, Los Angeles; SF, San Francisco; S, 
Seattle; A, Anchorage. The shaded area in (a) is an unacceptable 
overlap, (d) Vector diagrams of the relative plate velocities at the 
Kula-Pacific-American and Farallon-Pacific-American triple junctions 
(numbers are velocities in cm yr' relative to the American plate). 


throughout the Cenozoic. With this proviso, it is evident 
that triple junctions formed and migrated along the 
American plate margin. The Kula-American-Farallon 
RTF junction was slightly north of the present location of 
San Francisco 60 Ma ago (Fig. 1.23c); it moved to a posi¬ 
tion north of Seattle 20 Ma ago (Fig. 1.23a). Around that 
time in the Oligocene an FFT junction formed between 
San Francisco and Los Angeles, while the Farallon- 
Pacific-American RTF junction evolved to the south. The 
development of these two triple junctions is due to the col¬ 
lision and subduction of the Farallon-Pacific ridge at the 
Farallon- American trench. 

At the time of magnetic anomaly 13, about 34 Ma ago, 
a north-south striking ridge joined the Mendocino and 
Murray transform faults as part of the Farallon-Pacific 
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Fig. 1.24 Formation of the 
San Andreas fault as a result 
of the evolution of triple 
junctions in the northeast 
Pacific during the Oligocene: 
plate geometries at the times 
of (a) magnetic anomaly 13, 
about 34Ma ago, (b) anomaly 
9, about 27Ma ago (after 
McKenzie and Morgan, 

1969), and (c) further 
development when the 
Murray fracture zone collides 
with the trench. Double¬ 
headed arrows show 
directions of migration of 
triple junctions 1 and 2 along 
the consuming plate margin. 
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plate margin to the west of the American trench (Fig. 
1.24a). By the time of anomaly 9, about 27 Ma ago, the 
ridge had collided with the trench and been partly con¬ 
sumed by it (Fig. 1.24b). The Farallon plate now con¬ 
sisted of two fragments: an FFT junction developed at 
point 1, formed by the San Andreas fault system, the 
Mendocino fault and the consuming trench to the north; 
and an RTF junction formed at point 2. Both junctions 
are stable when the trenches are parallel to the transform 
fault along the San Andreas system. Analysis of the 
velocity diagrams at each triple junction shows that point 
1 migrated to the northwest and point 2 migrated to the 
southeast at this stage. Later, when the southern segment 
of the Farallon-Pacific ridge had been subducted under 
the American plate, the Murray transform fault changed 
the junction at point 2 to an FFT junction, which has sub¬ 
sequently also migrated to the northwest. 

1.2.8 Hotspots 

In 1958 S. W. Carey coined the term "hot spot” - now often 
reduced to “hotspot” - to refer to a long-lasting center of 
surface volcanism and locally high heat flow. At one time 
more than 120 of these thermal anomalies were proposed. 
Application of more stringent criteria has reduced their 
number to about 40 (Fig. 1.25). The hotspots may occur on 
the continents (e.g., Yellowstone), but are more common in 
the ocean basins. The oceanic hotspots are associated with 
depth anomalies. If the observed depth is compared with 
the depth predicted by cooling models of the oceanic 
lithosphere, the hotspots are found to lie predominantly in 
broad shallow regions, where the lithosphere apparently 


swells upward. This elevates denser mantle material, which 
creates a mass anomaly and disturbs the geoid; the effect is 
partially mitigated by reduced density of material in the 
hot, rising plume. The geoid surface is also displaced by 
subduction zones. The residual geoid obtained by remov¬ 
ing the effects associated with cold subducting slabs shows 
a remarkable correlation with the distribution of hotspots 
(Fig. 1.25). The oceanic hotspots are found in conjunction 
with intraplate island chains, which provide clues to the 
origin of hotspots and allow them to be used for measur¬ 
ing geodynamic processes. 

Two types of volcanic island chains are important in 
plate tectonics. The arcuate chains of islands associated 
with deep oceanic trenches at consuming plate margins 
are related to the process of subduction and have an 
arcuate shape. Nearly linear chains of volcanic islands 
are observed within oceanic basins far from active plate 
margins. These intraplate features are particularly 
evident on a bathymetric map of the Pacific Ocean. The 
Hawaiian, Marquesas, Society and Austral Islands form 
subparallel chains that trend approximately perpendicu¬ 
lar to the axis of ocean-floor spreading on the East 
Pacific rise. The most closely studied is the Hawaiian 
Ridge (Fig. 1.26a). The volcanism along this chain 
decreases from present-day activity at the southeast, on 
the island of Hawaii, to long extinct seamounts and 
guyots towards the northwest along the Emperor 
Seamount chain. The history of development of the 
chain is typical of other linear volcanic island chains in 
the Pacific basin (Fig. 1.26b). It was explained in 1963 by 
J. T. Wilson, before the modem theory of plate tectonics 
was formulated. 
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Fig. 1.25 The global 
distribution of 41 hotspots 
and their relationship to the 
residual geoid obtained by 
correcting geoid heights 
(shown in meters above the 
reference ellipsoid) for the 
effects of cold subducting 
slabs (after Crough and Jurdy, 
1980). 
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Fig. 1.26 (a) The Hawaiian Ridge and Emperor Seamount volcanic 
chains trace the motion of the Pacific plate over the Hawaiian 
hotspot; numbers give the approximate age of volcanism; note the 
change in direction about 43 Ma ago (after Van Andel, 1992). 

(b) Sketch illustrating the formation of volcanic islands and 
seamounts as a lithospheric plate moves over a hotspot (based 
on Wilson, 1963). 


A hotspot is a long-lasting magmatic center rooted in 
the mantle below the lithosphere. A volcanic complex is 
built up above the magmatic source, forming a volcanic 
island or, where the structure does not reach sea-level, a 
seamount. The motion of the plate transports the island 
away from the hotspot and the volcanism becomes 
extinct. The upwelling material at the hotspot elevates the 
ocean floor by up to 1500 m above the normal depth of 
the ocean floor, creating a depth anomaly. As they move 
away from the hotspot the by now extinct volcanic islands 
sink beneath the surface; some are truncated by erosion to 
sea-level and become guyots. Coral atolls may accumulate 
on some guyots. The volcanic chain is aligned with the 
motion of the plate. 

Confirmation of this theory is obtained from 
radiometric dating of basalt samples from islands and 
seamounts along the Hawaiian Ridge portion of the 
Hawaiian-Emperor chain. The basalts increase in age 
with distance from the active volcano Kilauea on the 
island of Hawaii (Fig. 1.27). The trend shows that 
the average rate of motion of the Pacific plate over the 
Hawaiian hotspot has been about 10 cm yr -1 during the 
last 20^10 Ma. The change in trend between the Hawaiian 
Ridge and the Emperor Seamount chain indicates a 
change in direction and speed of the Pacific plate about 
43 Ma ago, at which time there was a global reorganiza¬ 
tion of plate motions. The earlier rate of motion along 
the Emperor chain is less well determined but is estimated 
to be about 6 cm yr -1 . 

Radiometric dating of linear volcanic chains in the 
Pacific basin gives almost identical rates of motion over 
their respective hotspots. This suggests that the hotspots 
form a stationary network, at least relative to the lithos¬ 
phere. The velocities of plate motions over the hotspots 
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Fig. 1.27 Age of basaltic 
volcanism along the Hawaiian 
Islands as a function of 
distance from the active 
volcano Kilauea (based on 
Dalrymple etal., 1977). 
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are therefore regarded as absolute velocities, in contrast 
to the velocities derived at plate margins, which are 
the relative velocities between neighboring plates. The 
assumption that the hotspots are indeed stationary has 
been contested by studies that have yielded rates of inter¬ 
hotspot motion of the order of 1.5-2 cm yr~' (compara¬ 
ble to present spreading rates in the Atlantic). Thus, the 
notion of a stationary hotspot reference frame may only 
be valid for a limited time interval. Nevertheless, any 
motions between hotspots are certainly much slower than 
the motions of plates, so the hotspot reference frame pro¬ 
vides a useful guide to absolute plate motions over the 
typical time interval (~10Ma) in which incremental sea¬ 
floor spreading is constant. 

As well as geophysical evidence there are geochemical 
anomalies associated with hotspot volcanism. The type 
of basalt extruded at a hotspot is different from the 
andesitic basalts formed in subduction zone magma- 
tism. It also has a different petrology from the mid- 
oceanic ridge basalts (MORB) formed during sea-floor 
spreading and characteristic of the ocean floor. The 
hotspot source is assumed to be a mantle plume that 
reaches the surface. Mantle plumes are fundamental fea¬ 
tures of mantle dynamics, but they remain poorly under¬ 
stood. Although they are interpreted as long-term 
features it is not known for how long they persist, or how 
they interact with convective processes in the mantle. 
Their role in heat transport and mantle convection, with 
consequent influence on plate motions, is believed to be 
important but is uncertain. Their sources are controver¬ 
sial. Some interpretations favor a comparatively shallow 


origin above the 670 km discontinuity, but the prevailing 
opinion appears to be that the plumes originate in the 
D" layer at the core-mantle boundary. This requires 
the mantle plume to penetrate the entire thickness of the 
mantle (see Fig. 4.38). In either case the stationary 
nature of the hotspot network relative to the lithosphere 
provides a reference frame for determining absolute 
plate motions, and for testing the hypothesis of true 
polar wander. 

1.2.9 Plate motion on the surface of a sphere 

One of the great mathematicians of the eighteenth 
century was Leonhard Euler (1707-1783) of Switzerland. 
He made numerous fundamental contributions to pure 
mathematics, including to complex numbers (see Box 
2.6) and spherical trigonometry (see Box 1.4). A coroll¬ 
ary of one of his theorems shows that the displacement 
of a rigid body on the surface of a sphere is equival¬ 
ent to a rotation about an axis that passes through its 
center. This is applicable to the motion of a lithospheric 
plate. 

Any motion restricted to the surface of a sphere takes 
place along a curved arc that is a segment of either a great 
circle (centered, like a “circle of longitude,” at the Earth’s 
center) or a small circle. Small circles are defined relative 
to a pole of rotational symmetry (such as the geographi¬ 
cal pole, when we define “circles of latitude”). A point on 
the surface of the sphere can be regarded as the end-point 
of a radius vector from the center of the Earth to the 
point. Any position on the spherical surface can be sped- 
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Box 1.4: Spherical trigonometry 


The sides of a triangle on a plane surface are straight 
lines and the sum of its internal angles is 180° (or tt 
radians). Let the angles be A, B and C and the lengths of 
the sides opposite each of these angles be a, b and c, as 
in Fig. B1.4a. The sizes of the angles and the lengths of 
the sides are governed by the sine law: 

sin A _ sin B _ sin C . 

The length of any side is related to the lengths of the 
other two sides and to the angle they include by the 
cosine law, which for the side a is 

a 2 = b 2 + c 2 — IbccosA (2) 

with similar expressions for the sides b and c. 

The sides of a triangle on a spherical surface are 
great circle arcs and the sum of the internal angles is 
greater than 180°. The angle between two great circles at 
their point of intersection is defined by the tangents to 
the great circles at that point. Let the angles of a spheri¬ 
cal triangle be A, B and C, and let the lengths of the 
sides opposite each of these angles be a, b and c, respec¬ 
tively, as in Fig. B1.4b. The lengths of the sides may be 
converted to the angles they subtend at the center of the 
Earth. For example, the distance from pole to equator 
on the Earth’s surface may be considered as 10,007 km 
or as 90 degrees of arc. Expressing the sides of the 
spherical triangle as angles of arc, the law of sines is 

sin A _ sinB _ sinC 

cosfl cos b cos c ' 

and the law of cosines is 




Fig. B1.4 The sides and angles of (a) a plane triangle, (b) a spherical 
triangle. 


cos a = cos b cosc + sin b sine cos A 


(4) 


fled by two angles, akin to latitude and longitude, or, 
alternatively, by direction cosines (Box 1.5). As a result of 
Euler’s theorem any displacement of a point along a 
small circle is equivalent to rotating the radius vector 
about the pole of symmetry, which is called the Euler pole 
of the rotation. A displacement along a great circle - the 
shortest distance between two points on the surface of the 
sphere - is a rotation about an Euler pole 90° away from 
the arcuate path. Euler poles were described in the discus¬ 
sion of conservative plate margins (Section 1.2.6.3); they 
play an important role in paleogeographic reconstruc¬ 
tions using apparent polar wander paths (see Section 
5.6.4.3). 

1.2.9.1 Euler poles of rotation 

Geophysical evidence does not in itself yield absolute 
plate motions. Present-day seismicity reflects relative 
motion between contiguous plates, oceanic magnetic 


anomaly patterns reveal long-term motion between 
neighboring plates, and paleomagnetism does not 
resolve displacements in longitude about a paleopole. 
The relative motion between plates is described by 
keeping one plate fixed and moving the other one rela¬ 
tive to it; that is, we rotate it away from (or toward) the 
fixed plate (Fig. 1.28). The geometry of a rigid plate on 
the surface of a sphere is outlined by a set of bounding 
points, which maintain fixed positions relative to each 
other. Provided it remains rigid, each point of a moving 
plate describes an arc of a different small circle about the 
same Euler pole. Thus, the motion between plates is 
equivalent to a relative rotation about their mutual Euler 
rotation pole. 

The traces of past and present-day plate motions are 
recorded in the geometries of transform faults and frac¬ 
ture zones, which mark, respectively, the present-day and 
earlier locations of conservative plate margins. A 
segment of a transform fault represents the local path of 
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Box 1.5: Direction cosines 


It is often useful to express a direction with the aid of 
direction cosines. These are the cosines of the angles that 
the direction makes with the reference axes. Define the 
z-axis along the Earth’s spin axis, the .v-axis along the 
Greenwich meridian and the v-axis normal to both of 
these, as in Fig. B1.5. If a line makes angles a, a and 
a, to the x-, y- and z-axes, respectively, its direction 
cosines with respect to these axes are 

l — cos a x m = cosa v n — cos a z (1) 

Consider a position P on the Earth’s surface with lat¬ 
itude A and longitude <fi. A line of length R from the 
center of the Earth to the point P has projections 
Rcosa, (= R sinA) on the z-axis and Rsina, (= i?cosA) 
in the equatorial plane. The latter has projections 
(i?cosAcos<^>) and (/?cosAsinr/>) on the .v- and v-axes, 
respectively. The direction cosines of the line are thus 

/= cosAcos 0 

/M=cosAsin0 (2) 

n = sinA 

The angle A between two lines with direction cosines 
(/j, m v n x ) and (/.,, m 2 , n 2 ) is given by 


spin 

axis 



cosA =/j/ 2 + »7j/;7 2 + ;7j7;, (3) 

These relationships are useful for computing great 
circle distances and the angular relationships between 
lines. 


Euler 

rotation 



Fig. 1.28 Illustration that the displacement of a rigid plate on the 
surface of a sphere is equivalent to the rotation of the plate about an 
Euler pole (after Morgan, 1968) 

relative motion between two plates. As such, it defines a 
small circle about the Euler pole of relative rotation 
between the plates. Great circles drawn normal to the 
strike of the small circle (transform fault) should meet at 
the Euler pole (Fig. 1.29a), just as at the present day 
circles of longitude are perpendicular to circles of lati¬ 
tude and converge at the geographic pole. In 1968, W. J. 


Morgan first used this method to locate the Euler rota¬ 
tion pole for the present-day plate motion between 
America and Africa (Fig. 1.29b). The Caribbean plate 
may be absorbing slow relative motion, but the absence 
of a well-defined seismic boundary between North and 
South America indicates that these plates are now 
moving essentially as one block. The great circles normal 
to transform faults in the Central Atlantic converge and 
intersect close to 58°N 36°W, which is an estimate of the 
Euler pole of recent motion between Africa and South 
America. The longitude of the Euler pole is determined 
more precisely than its latitude, the errors being ± 2° and 
± 5°, respectively. When additional data from earthquake 
first motions and spreading rates are included, an Euler 
pole at 62°N 36°W is obtained, which is within the error 
of the first location. 

The “Bullard-type fit” of the African and South 
American coastlines (Section 1.2.2.2) is obtained by a 
rotation about a pole at 44°N 31°W. This pole reflects 
the average long-term motion between the continents. A 
rotation which matches a starting point with an end¬ 
point is a finite rotation. As the difference between the 
present-day and age-averaged Euler poles illustrates, a 
finite rotation is a mathematical formality not necessarily 
related to the actual motion between the plates, which 
may consist of a number of incremental rotations about 
different poles. 
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Fig. 1.29 (a) Principle of the method for locating the Euler pole of 
rotation between two plates where great circles normal to transform 
faults on the plate boundary intersect (after Kearey and Vine, 1990). (b) 
Location of the Euler pole of rotation for the motion between Africa 
and South America, using transform faults on the Mid-Atlantic Ridge in 
the Central Atlantic (after Morgan, 1968). 


1.2.9.2 Absolute plate motions 

The axial dipole hypothesis of paleomagnetism states 
that the mean geomagnetic pole - averaged over several 
tens of thousands of years - agrees with the contempora¬ 
neous geographic pole (i.e., rotation axis). Paleomagnetic 
directions allow the calculation of the apparent pole 
position at the time of formation of rocks of a given age 
from the same continent. By connecting the pole posi¬ 
tions successively in order of their age, an apparent polar 
wander (APW) path is derived for the continent. Viewed 
from the continent it appears that the pole (i.e., the rota¬ 
tion axis) has moved along the APW path. In fact, the 
path records the motion of the lithospheric plate bearing 
the continent, and differences between APW paths for 
different plates reflect motions of the plates relative to 
each other. 

During the displacement of a plate (i.e., when it 
rotates about an Euler pole), the paleomagnetic pole 
positions obtained from rocks on the plate describe a 
trajectory which is the arc of a small circle about the 
Euler pole (Fig. 1.30). The motion of the plate over an 
underlying hotspot leaves a trace that is also a small circle 
arc about the same hotspot. The paleomagnetic record 
gives the motion of plates relative to the rotation axis, 



Fig. 1.30 Development of an arcuate apparent polar wander path and 
hotspot trace as small circles about the same Euler pole, when a mobile 
plate M moves relative to a fixed plate F (after Butler, 1992). 


whereas the hotspot record shows the plate motion over a 
fixed point in the mantle. If the mantle moves relative to 
the rotation axis, the network of hotspots - each believed 
to be anchored to the mantle - shifts along with it. This 
motion of the mantle deeper than the mobile lithosphere 
is called true polar wander (TPW). The term is rather a 
misnomer, because it refers to motion of the mantle rela¬ 
tive to the rotation axis. 

Paleomagnetism provides a means of detecting whether 
long-term true polar wander has taken place. It involves 
comparing paleomagnetic poles from hotspots with con¬ 
temporary poles from the stable continental cratons. 
Consider first the possibility that TPW does not take 
place: each hotspot maintains its position relative to the 
rotation axis. A lava that is magnetized at an active 
hotspot acquires a direction appropriate to the distance 
from the pole. If the plate moves from north to south over 
the stationary hotspot, a succession of islands and 
seamounts (Fig. 1.31a, A-D) is formed, which, indepen¬ 
dently of their age, have the same magnetization direction. 
Next, suppose that true polar wander does take place: each 
hotspot moves with time relative to the rotation axis. For 
simplicity, let the hotspot migration also be from north to 
south (Fig. 1.31b). Seamount A is being formed at present 
and its magnetization direction corresponds to the present- 
day distance from the pole. However, older seamounts B, C 
and D were formed closer to the pole and have progres¬ 
sively steeper inclinations the further south they are. The 
change in paleomagnetic direction with age of the volcan- 
ism along the hotspot trace is evidence for true polar 
wander. 

To test such a hypothesis adequately a large number of 
data are needed. The amount of data from a single plate, 
such as Africa, can be enlarged by using data from other 
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but that its amplitude has remained less than 15° for the 
last 150Ma. 



fixed 

hotspot 


(b) 


paleomagnetic inclination 



Hotspot position at age: 0 10 20 30 M a 


Fig. 1.31 Illustration of the effect of true polar wander on 
paleomagnetic inclination: (a) north-south plate motion over a 
stationary hotspot, (b) same plate motion over a north-south migrating 
hotspot. A, B, C and D are sequential positions. 


plates. For example, in reconstructing Gondwanaland, 
South America is rotated into a matching position with 
Africa by a finite rotation about an Euler pole. The same 
rotation applied to the APW path of South America 
allows data from both continents to be combined. 
Likewise, rotations about appropriate Euler poles make 
the paleomagnetic records for North America and 
Eurasia accessible. Averaging the pooled data for age- 
windows 10 Ma apart gives a reconstructed paleomag¬ 
netic APW path for Africa (Fig. 1.32a). The next step is to 
determine the motions of plates over the network of 
hotspots, assuming the hotspots have not moved relative 
to each other. A “hotspot” apparent polar wander path is 
obtained, which is the track of an axis in the hotspot ref¬ 
erence frame presently at the north pole. The appearance 
of this track relative to Africa is shown in Fig. 1.32b. 

We now have records of the motion of the lithosphere 
relative to the pole, and of the motion of the lithosphere 
relative to the hotspot reference frame. The records coin¬ 
cide for the present time, both giving pole positions at the 
present-day rotation axis, but they diverge with age as a 
result of true polar wander. A paleomagnetic pole of a 
given age is now moved along a great circle (i.e., rotated 
about an Euler pole in the equatorial plane) until it lies on 
the rotation axis. If the same rotation is applied to the 
hotspot pole of the same age, it should fall on the rotation 
axis also. The discrepancy is due to motion of the hotspot 
reference frame relative to the rotation axis. Joining loca¬ 
tions in order of age gives a true polar wander path (Fig. 
1.32c). This exercise can be carried out for only the last 
200 Ma, in which plate reconstructions can be confidently 
made. The results show that TPW has indeed taken place 


1.2.10 Forces driving plate tectonic motions 

An unresolved problem of plate tectonics is what mecha¬ 
nism drives plate motions. The forces acting on plates 
may be divided into forces that act on their bottom sur¬ 
faces and forces that act on their margins. The bottom 
forces arise due to relative motion between the lithos¬ 
pheric plate and the viscous asthenosphere. In this 
context it is less important whether mantle flow takes 
place by whole-mantle convection or layered convection. 
For plate tectonics the important feature of mantle rheol¬ 
ogy is that viscous flow in the upper mantle is possible. 
The motion vectors of lithospheric plates do not reveal 
directly the mantle flow pattern, but some general infer¬ 
ences can be drawn. The flow pattern must include the 
mass transport involved in moving lithosphere from a 
ridge to a subduction zone, which has to be balanced by 
return flow deep in the mantle. Interactions between the 
plates and the viscous substratum necessarily influence 
the plate motions. In order to assess the importance 
of these effects we need to compare them to the other 
forces that act on plates, especially at their boundaries 
(Fig. 1.33). 

1.2.10.1 Forces acting on lithospheric plates 

Some forces acting on lithospheric plates promote motion 
while others resist it. Upper mantle convection could fall 
into either category. The flow of material beneath a plate 
exerts a mantle drag force (F df ) on the base of the plate. If 
the convective flow is faster than plate velocities, the 
plates are dragged along by the flow, but if the opposite is 
true the mantle drag opposes the plate motion. Plate 
velocities are observed to be inversely related to the area 
of continent on the plate, which suggests that the greater 
lithospheric thickness results in an additional continental 
drag force (F CD ) on the plate. The velocity of a plate also 
depends on the length of its subduction zone but not on 
the length of its spreading ridge. This suggests that sub¬ 
duction forces may be more important than spreading 
forces. This can be evaluated by considering the forces at 
all three types of plate margin. 

At spreading ridges, upwelling magma is associated 
with the constructive margin. It was long supposed that 
this process pushes the plates away from the ridge. It also 
elevates the ridges above the oceanic abyss, so that poten¬ 
tial energy encourages gravitational sliding toward the 
trenches. Together, the two effects make up the ridge push 
force (F Rp ). 

At transform faults, high seismicity is evidence of inter¬ 
active forces where the plates move past each other. A 
transform force (F tf ) can be envisioned as representing 
frictional resistance in the contact zone. Its magnitude 
may be different at a transform connecting ridge segments, 
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Fig. 1.32 (a) Paleomagnetic 
APW path reconstructed for 
Africa using data from several 
plates, (b) Hotspot APW path 
(motion of an axis at the 
geographic pole relative to 
the hotspot reference frame), 
(c) Computed true polar 
wander path (based on data 
from Courtillot and Besse, 
1987, and Morgan, 1982). 
Values represent age in Ma. 



Fig. 1.33 Diagram illustrating 
some of the different forces 
acting on lithospheric plates 
(after Forsyth and Uyeda, 
1975; Uyeda, 1978). 


continental plate 



where the plates are hot, than at a transform between sub- 
duction zones, where the plates are cold. 

At subduction zones, the descending slab of lithosphere 
is colder and denser than the surrounding mantle. This 
creates a positive mass anomaly - referred to as negative 
buoyancy - which is accentuated by intraplate phase tran¬ 
sitions. If the descending slab remains attached to the 
surface plate, a slab pull force (T sp ) ensues that pulls the 
slab downwards into the mantle. Transferred to the entire 
plate it acts as a force toward the subduction zone. 
However, the subducting plate eventually sinks to depths 
where it approaches thermal equilibrium with the sur¬ 
rounding mantle, loses its negative buoyancy and experi¬ 
ences a slab resistance force (T SR ) as it tries to penetrate 
further into the stiffer mantle. 

Plate collisions result in both driving and resistive 
forces. The vertical pull on the descending plate may cause 


the bend in the lower plate to migrate away from the sub¬ 
duction zone, effectively drawing the upper plate toward 
the trench. The force on the upper plate has also been 
termed “ trench suction” (T su ). The colliding plates also 
impede each other’s motion and give rise to a collision- 
resistance force (F cb ). This force consists of separate 
forces due to the effects of mountains or trenches in the 
zone of convergence. 

At hotspots, the transfer of mantle material to the 
lithosphere may result in a hotspot force (F HS ) on the 
plate. 

In summary, the driving forces on plates are slab pull, 
slab suction, ridge push and the trench pull force on the 
upper plate. The motion is opposed by slab resistance, col¬ 
lision resistance, and transform fault forces. Whether the 
forces between plate and mantle (mantle drag, continental 
drag) promote or oppose motion depends on the sense of 
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Fig. 1.34 Comparison of the magnitudes of torques acting on the 12 
major lithospheric plates (after Chappie and Tullis, 1977). 


the relative motion between the plate and the mantle. The 
motive force of plate tectonics is clearly a composite of 
these several forces. Some can be shown to be more impor¬ 
tant than others, and some are insignificant. 


1.2.10.2 Relative magnitudes of forces driving plate 
motions 

In order to evaluate the relative importance of the forces 
it is necessary to take into account their different direc¬ 
tions. This is achieved by converting the forces to torques 
about the center of the Earth. Different mathematical 
analyses lead to similar general conclusions regarding the 
relative magnitudes of the torques. The push exerted by 
hotspots and the resistance at transform faults are negli¬ 
gible in comparison to the other forces (Fig. 1.34). The 
ridge push force is much smaller than the forces at a con¬ 
verging margin, and it is considered to be of secondary 
importance. Moreover, the topography of oceanic ridges 
is offset by transform faults. If the ridge topography were 
due to buoyant upwelling, the fluid mantle could not 
exhibit discontinuities at the faults but would bulge 
beyond the ends of ridge segments. Instead, sharp offsets 
are observed, indicating that the topography is an expres¬ 
sion of local processes in the oceanic lithosphere. This 
implies that upwelling at ridges is a passive feature, with 


mantle material filling space created by the plates moving 
apart. 

The torque analysis shows that the strongest force 
driving plate motions is the pull of a descending slab on its 
plate; the force that pulls the upper plate toward a trench 
may also be considerable. The opposing force due to the 
collision between the plates is consistently smaller than the 
upper plate force. The resistance experienced by some slabs 
to deep mantle penetration may diminish the slab pull 
force. However, seismic evidence has shown that some 
slabs may become detached from their parent plate, and 
apparently sink all the way to the core-mantle boundary. 
The descending motion contributes to mantle circulation, 
and thus acts indirectly as a driving force for plate motions; 
it is known as slab suction. However, analysis of this force 
has shown that it is less important than slab pull, which 
emerges as the most important force driving plate motions. 
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I. 4 REVIEW QUESTIONS 

1. Write down Kepler’s three laws of planetary motion. 
Which law is a result of the conservation of momen¬ 
tum? Which law is a result of the conservation of 
energy? 

2. The gravitational attraction of the Sun on an orbiting 
planet is equal to the centripetal acceleration of the 
planet. Show for a circular orbit that this leads to 
Kepler's third law of motion. 

3. What causes the precession of the Earth’s rotation 
axis? Why is it retrograde? 

4. What other long-term changes of the rotation axis or 
the Earth's orbit occur? What are the periods of these 
motions? What are their causes? 

5. If a planet existed in place of the asteroid belt, what 
would Bode’s law predict for the radius of its orbit? 
What would be the period of its orbital motion 
around the Sun? 

6. What is the nebular hypothesis for the origin of the 
solar system? 

7. What geological evidence is there in support of conti¬ 
nental drift! What is the essential difference between 
older models of continental drift and the modern 
theory of plate tectonics? 

8. What was Pangaea? When and how did it form? 
When and how did it break up? 

9. What is the Earth’s crust? What is the lithosphere? 
How are they distinguished? 

10. What are the major discontinuities in the Earth’s 
internal structure? How are they known? 

II. Distinguish between constructive, conservative and 
destructive plate margins. 

12. Make a brief summary, using appropriate sketches, of 
geological and geophysical data from plate margins 
and their plate tectonic interpretations. 

13. What kind of plate margin is a continental collision 
zone? How does it differ from a subduction zone? 

14. Describe the Vine-Matthews-Morley hypothesis of 
sea-floor spreading. 

15. Explain how sea-floor spreading can be used to 
determine the age of the oceanic crust. Where 
are the oldest parts of the oceans? How old are 
they? How does this age compare to the age of the 
Earth? 

16. What are the names of the 12 major tectonic plates 
and where do their plate margins lie? 

17. With the aid of a globe or map, estimate roughly a 
representative distance across one of the major 
plates. What is the ratio of this distance to the thick¬ 
ness of the plate? Why are the tectonic units called 
plates ? 

18. What is a triple junction? Explain the role of triple 
junctions in plate tectonics. 

19. What is a hotspot? Explain how the Hawaiian 
hotspot provides evidence of a change in motion of 
the Pacific plate. 


20. How may the Euler pole of relative rotation between 
two plates be located? 


1.5 EXERCISES 

1. Measured from a position on the Earth's surface at 
the equator, the angle between the direction to the 
Moon and a reference direction (distant star) in the 
plane of the Moon’s orbit is 11°57' at 8 p.m. one 
evening and 14°32' at 4 a.m. the following morning. 
Assuming that the Earth, Moon and reference star 
are in the same plane, and that the rotation axis is 
normal to the plane, estimate the approximate dis¬ 
tance between the centres of the Earth and Moon. 

2. The eccentricity e of the Moon’s orbit is 0.0549 and 
the mean orbital radius r L = ( ab) m is 384,100 km. 

(a) Calculate the lengths of the principal axes a and 
b of the Moon’s orbit. 

(b) How far is the center of the Earth from the center 
of the elliptical orbit? 

(c) Calculate the distances of the Moon from the 
Earth at perigee and apogee. 

3. If the Moon’s disk subtends a maximum angle of 
0° 31' 36.8" at the surface of the Earth, what is the 
Moon's radius? 

4. Bode’s Law (Eq. (1.3)) gives the orbital radius of the 
nth planet from the Sun (counting the asteroid belt) 
in astronomical units. It fits the observations well 
except for Neptune (n = 9 ) and Pluto (n = 10 ). 
Calculate the orbital radii of Neptune and Pluto pre¬ 
dicted by Bode’s Law, and compare the results with 
the observed values (Table 1.2). Express the discrep¬ 
ancies as percentages of the predicted distances. 

5. An ambulance passes a stationary observer at the 
side of the road at a speed of 60 km h -1 . Its dual tone 
siren emits alternating tones with frequencies of 700 
and 1700 Hz. What are the dual frequencies heard by 
the observer (a) before and (b) after the ambulance 
passes? [Assume that the speed of sound, c, in m s -1 
at the temperature T (°C) is c— 331 + 0.607T.] 

6. A spacecraft landing on the Moon uses the Doppler 
effect on radar signals transmitted at a frequency 
of 5 GHz to determine the landing speed. The pilot 
discovers that the precision of the radar instrument 
has deteriorated to ±100 Hz. Is this adequate to 
ensure a safe landing? [Speed of light 300,000 km s -1 .] 

7. Explain with the aid of a sketch the relationship 
between the length of a day and the length of a year 
on the planet Mercury (see Section 1.1.3.2). 

8. The rotations of the planet Pluto and its moon 
Charon about their own axes are synchronous with 
the revolution of Charon about Pluto. Show with the 
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aid of simple sketches that Pluto and Charon always 
present the same face to each other. 

9. The barycenter of a star and its planet - or of a 
planet and its moon - is the center of mass of the 
pair. Using the mass and radius of primary body and 
satellite, and the orbital radius of the satellite, as 
given in Tables 1.1-1.3 or below, calculate the loca¬ 
tion of the barycenter of the following pairs of 
bodies. In each case, does the barycenter lie inside or 
outside the primary body? 

(a) Sun and Earth. 

(b) Sun and Jupiter. 

(c) Earth and Moon. 

(d) Pluto (mass 1.27 X 10 22 kg, radius 1137 km) and 
Charon (mass 1.9 X 10 21 kg, radius 586 km); the 
radius of Charon's orbit is 19,640 km. 

10. A planet with radius R has a mantle with uniform 
density p m enclosing a core with radius r c and 
uniform density p Q . Show that the mean density of 
the planet p is given by 

p-p m = (Vf 

Pc - Pm \R) 

11. The radius of the Moon is 1738 km and its mean 
density is 3347 kg m -3 . If the Moon has a core 
with radius 400 km and the uniform density of the 
overlying mantle is 3300 kg m -3 , what is the density 
of the core? 

12. Summarize the geological and geophysical evidence 
resulting from plate tectonic activity in the following 
regions: (a) Iceland, (b) the Aleutian islands, (c) 
Turkey, (d) the Andes, (e) the Alps? 


13. Using the data in Fig 5.77, compute the approximate 
spreading rates in the age interval 25-45 Ma at the 
oceanic ridges in the S. Atlantic, S. Indian, N. Pacific 
and S. Pacific oceans. 

14. Three ridges A, B and C meet at a triple junction. 
Ridge A has a strike of 329° (N31W) and a spreading 
rate of 7.0cm yr _1 ; ridge B strikes at 233° (S53W) 
and has a spreading rate of 5.0cm yr -1 . Determine 
the strike of ridge C and its spreading rate. 

15. Three sides of a triangle on the surface of the spheri¬ 
cal Earth measure 900 km, 1350 km, and 1450 km, 
respectively. What are the internal angles of the trian¬ 
gle? If this were a plane triangle, what would the 
internal angles be? 

16. An aircraft leaves a city at latitude and longitude 
<f> x and flies to a second city at latitude A 2 and longi¬ 
tude Derive an expression for the great circle dis¬ 
tance between the two cities. 

17. Apply the above formula to compute the great circle 
distances between the following pairs of cities: 

(a) New York (A =40° 43' N, <j> 1 = 74° T W)- 
Madrid (40° 25' N, 3° 43' W); 

(b) Seattle (A=47° 21' N, ^ = 122° 12' W)- 
Sydney (A = 33° 52' S, 4> x = 151° 13' E); 

(c) Moscow (A =55° 45' N, <^ = 37° 35' E)- 
Paris (A = 48° 52' N, </>, = 2° 20' E); 

(d) London (A = 51° 30' N, r/>, = 0° 10' W)- 
Tokyo (A=35° 42' N, <j> x = 139° 46' E). 

18. Calculate the heading (azimuth) of the aircraft’s 
flight path as it leaves the first city in each pair of 
cities in the previous exercise. 



2 Gravity, the figure of the Earth and geodynamics 


2.1 THE EARTH'S SIZE AND SHAPE 

2.1.1 Earth's size 

The philosophers and savants in ancient civilizations 
could only speculate about the nature and shape of the 
world they lived in. The range of possible travel was 
limited and only simple instruments existed. Unrelated 
observations might have suggested that the Earth's surface 
was upwardly convex. For example, the Sun’s rays con¬ 
tinue to illuminate the sky and mountain peaks after its 
disk has already set, departing ships appear to sink slowly 
over the horizon, and the Earth’s shadow can be seen to be 
curved during partial eclipse of the Moon. However, early 
ideas about the heavens and the Earth were intimately 
bound up with concepts of philosophy, religion and 
astrology. In Greek mythology the Earth was a disk¬ 
shaped region embracing the lands of the Mediterranean 
and surrounded by a circular stream, Oceanus, the origin 
of all the rivers. In the sixth century BC the Greek 
philosopher Anaximander visualized the heavens as a 
celestial sphere that surrounded a flat Earth at its center. 
Pythagoras (582-507 BC) and his followers were appar¬ 
ently the first to speculate that the Earth was a sphere. This 
idea was further propounded by the influential philoso¬ 
pher Aristotle (384-322 BC). Although he taught the sci¬ 
entific principle that theory must follow fact, Aristotle is 
responsible for the logical device called syllogism, which 
can explain correct observations by apparently logical 
accounts that are based on false premises. His influence on 
scientific methodology was finally banished by the scien¬ 
tific revolution in the seventeenth century. 

The first scientifically sound estimate of the size of the 
terrestrial sphere was made by Eratosthenes (275-195 
BC), who was the head librarian at Alexandria, a Greek 
colony in Egypt during the third century BC. Eratosthenes 
had been told that in the city of Syene (modern Aswan) 
the Sun’s noon rays on midsummer day shone vertically 
and were able to illuminate the bottoms of wells, whereas 
on the same day in Alexandria shadows were cast. Using 
a sun-dial Eratosthenes observed that at the summer sol¬ 
stice the Sun's rays made an angle of one-fiftieth of a 
circle (7.2°) with the vertical in Alexandria (Fig. 2.1). 
Eratosthenes believed that Syene and Alexandria were on 
the same meridian. In fact they are slightly displaced; 
their geographic coordinates are 24° 5'N 32° 56'E and 



Fig. 2.1 The method used by Eratosthenes (275-195 BC) to estimate 
the Earth's circumference used the 7.2° difference in altitude of the 
Sun's rays at Alexandria and Syene, which are 5000 stadia apart (after 
Strahler, 1963). 

31° 13'N 29° 55'E, respectively. Syene is actually about 
half a degree north of the tropic of Cancer. Eratosthenes 
knew that the approximate distance from Alexandria to 
Syene was 5000 stadia, possibly estimated by travellers 
from the number of days (“10 camel days”) taken to travel 
between the two cities. From these observations 
Eratosthenes estimated that the circumference of the 
global sphere was 250,000 stadia. The Greek stadium was 
the length (about 185 m) of the U-shaped racecourse on 
which footraces and other athletic events were carried 
out. Eratosthenes’ estimate of the Earth’s circumference 
is equivalent to 46,250 km, about 15% higher than the 
modern value of 40,030 km. 

Estimates of the length of one meridian degree were 
made in the eighth century AD during the Tang dynasty in 
China, and in the ninth century AD by Arab astronomers 
in Mesopotamia. Little progress was made in Europe until 
the early seventeenth century. In 1662 the Royal Society 
was founded in London and in 1666 the Academie Royale 
des Sciences was founded in Paris. Both organizations 
provided support and impetus to the scientific revolution. 
The invention of the telescope enabled more precise geo¬ 
detic surveying. In 1671 a French astronomer, Jean Picard 
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(1620-1682), completed an accurate survey by triangula¬ 
tion of the length of a degree of meridian arc. From his 
results the Earth’s radius was calculated to be 6372 km, 
remarkably close to the modern value of 6371 km. 

2.1.2 Earth's shape 

In 1672 another French astronomer, Jean Richer, was sent 
by Louis XIV to make astronomical observations on the 
equatorial island of Cayenne. Fie found that an accurate 
pendulum clock, which had been adjusted in Paris pre¬ 
cisely to beat seconds, was losing about two and a half 
minutes per day, i.e., its period was now too long. The 
error was much too large to be explained by inaccuracy of 
the precise instrument. The observation aroused much 
interest and speculation, but was only explained some 15 
years later by Sir Isaac Newton in terms of his laws of 
universal gravitation and motion. 

Newton argued that the shape of the rotating Earth 
should be that of an oblate ellipsoid ; compared to a sphere, 
it should be somewhat flattened at the poles and should 
bulge outward around the equator. This inference was 
made on logical grounds. Assume that the Earth does not 
rotate and that holes could be drilled to its center along the 
rotation axis and along an equatorial radius (Fig. 2.2). If 
these holes are filled with water, the hydrostatic pressure at 
the center of the Earth sustains equal water columns along 
each radius. However, the rotation of the Earth causes a 
centrifugal force at the equator but has no effect on the axis 
of rotation. At the equator the outward centrifugal force 
of the rotation opposes the inward gravitational attraction 
and pulls the water column upward. At the same time it 
reduces the hydrostatic pressure produced by the water 
column at the Earth’s center. The reduced central pressure 
is unable to support the height of the water column along 
the polar radius, which subsides. If the Earth were a hydro¬ 
static sphere, the form of the rotating Earth should be an 
oblate ellipsoid of revolution. Newton assumed the Earth’s 
density to be constant and calculated that the flattening 
should be about 1:230 (roughly 0.5%). This is somewhat 
larger than the actual flattening of the Earth, which is 
about 1:298 (roughly 0.3%). 

The increase in period of Richer’s pendulum could 
now be explained. Cayenne was close to the equator, 
where the larger radius placed the observer further from 
the center of gravitational attraction, and the increased 
distance from the rotational axis resulted in a stronger 
opposing centrifugal force. These two effects resulted in a 
lower value of gravity in Cayenne than in Paris, where the 
clock had been calibrated. 

There was no direct proof of Newton’s interpretation. A 
corollary of his interpretation was that the degree of merid¬ 
ian arc should subtend a longer distance in polar regions 
than near the equator (Fig. 2.3). Early in the eighteenth 
century French geodesists extended the standard meridian 
from border to border of the country and found a puzzling 
result. In contrast to the prediction of Newton, the degree 



centrifugal 
force reduces 
1—J> gravity 


Fig. 2.2 Newton's argument that the shape of the rotating Earth should 
be flattened at the poles and bulge at the equator was based on 
hydrostatic equilibrium between polar and equatorial pressure columns 
(after Strahler, 1963). 




Fig. 2.3 (a) The length of a degree of meridian arc is found by 
measuring the distance between two points that lie one degree apart 
on the same meridian, (b) The larger radius of curvature at the flattened 
poles gives a longer arc distance than is found at the equator where the 
radius of curvature is smaller (after Strahler, 1963). 


of meridian arc decreased northward. The French interpre¬ 
tation was that the Earth’s shape was a prolate ellipsoid, 
elongated at the poles and narrowed at the equator, like the 
shape of a rugby football. A major scientific controversy 
arose between the "flatteners” and the “elongators.” 
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To determine whether the Earth’s shape was oblate or 
prolate, the Academie Royale des Sciences sponsored two 
scientific expeditions. In 1736-1737 a team of scientists 
measured the length of a degree of meridian arc in 
Lapland, near the Arctic Circle. They found a length 
appreciably longer than the meridian degree measured by 
Picard near Paris. From 1735 to 1743 a second party of 
scientists measured the length of more than 3 degrees of 
meridian arc in Peru, near the equator. Their results 
showed that the equatorial degree of latitude was shorter 
than the meridian degree in Paris. Both parties confirmed 
convincingly the prediction of Newton that the Earth’s 
shape is that of an oblate ellipsoid. 

The ellipsoidal shape of the Earth resulting from its 
rotation has important consequences, not only for the 
variation with latitude of gravity on the Earth’s surface, 
but also for the Earth’s rate of rotation and the orienta¬ 
tion of its rotational axis. These are modified by torques 
that arise from the gravitational attractions of the Sun, 
Moon and planets on the ellipsoidal shape. 

2.2 GRAVITATION 

2.2.1 The law of universal gravitation 

Sir Isaac Newton (1642-1727) was born in the same year 
in which Galileo died. Unlike Galileo, who relished 
debate, Newton was a retiring person and avoided con¬ 
frontation. His modesty is apparent in a letter written in 
1675 to his colleague Robert Hooke, famous for his exper¬ 
iments on elasticity. In this letter Newton made the 
famous disclaimer “if I have seen further (than you and 
Descartes) it is by standing upon the shoulders of 
Giants.” In modern terms Newton would be regarded as a 
theoretical physicist. He had an outstanding ability to 
synthesize experimental results and incorporate them 
into his own theories. Faced with the need for a more 
powerful technique of mathematical analysis than existed 
at the time, he invented differential and integral calculus, 
for which he is credited equally with Gottfried Wilhelm 
von Leibnitz (1646-1716) who discovered the same 
method independently. Newton was able to resolve many 
issues by formulating logical thought experiments; an 
example is his prediction that the shape of the Earth is an 
oblate ellipsoid. He was one of the most outstanding syn¬ 
thesizers of observations in scientific history, which is 
implicit in his letter to Hooke. His three-volume book 
Philosophiae Naturalis Principia Mathematica, published 
in 1687, ranks as the greatest of all scientific texts. The 
first volume of the Principia contains Newton’s famous 
Laws of Motion, the third volume handles the Law of 
Universal Gravitation. 

The first two laws of motion are generalizations from 
Galileo’s results. As a corollary Newton applied his laws 
of motion to demonstrate that forces must be added as 
vectors and showed how to do this geometrically with a 
parallelogram. The second law of motion states that the 


rate of change of momentum of a mass is proportional 
to the force acting upon it and takes place in the direction 
of the force. For the case of constant mass, this law serves 
as the definition of force (F) in terms of the acceleration 
(a) given to a mass (m): 

F = /«a (2.1) 

The unit of force in the SI system of units is the 
newton (N). It is defined as the force that gives a mass of 
one kilogram (1 kg) an acceleration of 1 m s -2 . 

His celebrated observation of a falling apple may be a 
legend, but Newton’s genius lay in recognizing that the 
type of gravitational field that caused the apple to fall was 
the same type that served to hold the Moon in its orbit 
around the Earth, the planets in their orbits around the 
Sun, and that acted between minute particles characterized 
only by their masses. Newton used Kepler’s empirical third 
law (see Section 1.1.2 and Eq. (1.2)) to deduce that the 
force of attraction between a planet and the Sun varied 
with the “quantities of solid matter that they contain” (i.e., 
their masses) and with the inverse square of the distance 
between them. Applying this law to two particles or point 
masses m and M separated by a distance r (Fig. 2.4a), we 
get for the gravitational attraction F exerted by M on m 


In this equation r is a unit vector in the direction of 
increase in coordinate r, which is directed away from the 
center of reference at the mass M. The negative sign in the 
equation indicates that the force F acts in the opposite 
direction, toward the attracting mass M. The constant G, 
which converts the physical law to an equation, is the con¬ 
stant of universal gravitation. 

There was no way to determine the gravitational con¬ 
stant experimentally during Newton’s lifetime. The 
method to be followed was evident, namely to determine 
the force between two masses in a laboratory experiment. 
However, seventeenth century technology was not yet up 
to this task. Experimental determination of G was 
extremely difficult, and was first achieved more than a 
century after the publication of Principia by Lord 
Charles Cavendish (1731-1810). From a set of painstak¬ 
ing measurements of the force of attraction between two 
spheres of lead, Cavendish in 1798 determined the value 
of G to be 6.754 X 10 -11 m 3 kg -1 s -2 . A modern value 
(Mohr and Taylor, 2005) is 6.674 210 X 10 -11 m 3 kg -1 s -2 . 
It has not yet been possible to determine Gmore precisely, 
due to experimental difficulty. Although other physical 
constants are now known with a relative standard uncer¬ 
tainty of much less than 1 X 10 -6 , the gravitational con¬ 
stant is known to only 150 X 10 -6 . 

2.2.1.1 Potential energy and work 

The law of conservation of energy means that the total 
energy of a closed system is constant. Two forms of 
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(a) point masses 

M F m 

• . >r 

k. r .>| 


(b) point mass and sphere 



(c) point mass on Earth's surface 



Fig. 2.4 Geometries for the gravitational attraction on (a) two point 
masses, (b) a point mass outside a sphere, and (c) a point mass on the 
surface of a sphere. 

energy need be considered here. The first is the potential 
energy, which an object has by virtue of its position rela¬ 
tive to the origin of a force. The second is the work 
done against the action of the force during a change in 
position. 

For example, when Newton’s apple is on the tree it has 
a higher potential energy than when it lies on the ground. 
It falls because of the downward force of gravity and loses 
potential energy in doing so. To compute the change in 
potential energy we need to raise the apple to its original 
position. This requires that we apply a force equal and 
opposite to the gravitational attraction on the apple and, 
because this force must be moved through the distance the 
apple fell, we have to expend energy in the form of work. If 
the original height of the apple above ground level was h 
and the value of the force exerted by gravity on the apple is 
F, the force we must apply to put it back is (-F). 
Assuming that F is constant through the short distance of 
its fall, the work expended is (— F)h. This is the increase in 
potential energy of the apple, when it is on the tree. 

More generally, if the constant force F moves through 
a small distance dr in the same direction as the force, the 
work done is d W= Fdr and the change in potential 
energy d E p is given by 

d£p= -dW= -Fdr (2.3) 

In the more general case we have to consider motions 
and forces that have components along three orthogonal 
axes. The displacement dr and the force F no longer need 
to be parallel to each other. We have to treat F and dr as 
vectors. In Cartesian coordinates the displacement vector 
dr has components (dx, Ay, dz) and the force has compo¬ 
nents (F x , F y , F z ) along each of the respective axes. The 


work done by the ^-component of the force when it is dis¬ 
placed along the x-axis is F x dx, and there are similar 
expressions for the displacements along the other axes. 
The change in potential energy d£p is now given by 

d£p = - d W = — (F x dx + F y dy + FA:) (2.4) 

The expression in brackets is called the scalar product 
of the vectors Tand dr. It is equal to Fdr cost), where 6 is 
the angle between the vectors. 

2.2.2 Gravitational acceleration 

In physics the field of a force is often more important than 
the absolute magnitude of the force. The field is defined as 
the force exerted on a material unit. For example, the elec¬ 
trical field of a charged body at a certain position is the 
force it exerts on a unit of electrical charge at that loca¬ 
tion. The gravitationalfield in the vicinity of an attracting 
mass is the force it exerts on a unit mass. Equation (2.1) 
shows that this is equivalent to the acceleration vector. 

In geophysical applications we are concerned with 
accelerations rather than forces. By comparing Eq. (2.1) 
and Eq. (2.2) we get the gravitational acceleration a G of 
the mass m due to the attraction of the mass M: 

a G=- G 72 r < 2 - 5 ) 

The SI unit of acceleration is the m s -2 ; this unit is 
unpractical for use in geophysics. In the now superseded 
c.g.s. system the unit of acceleration was the cm s -2 , 
which is called a gal in recognition of the contributions of 
Galileo. The small changes in the acceleration of gravity 
caused by geological structures are measured in thou¬ 
sandths of this unit, i.e., in milligal ( mgal ). Until recently, 
gravity anomalies due to geological structures were sur¬ 
veyed with field instruments accurate to about one-tenth 
of a milligal, which was called a gravity unit. Modern 
instruments are capable of measuring gravity differences 
to a millionth of a gal, or microgal (yrgal), which is 
becoming the practical unit of gravity investigations. The 
value of gravity at the Earth's surface is about 9.8 m s -2 , 
and so the sensitivity of modern measurements of gravity 
is about 1 part in 10 9 . 

22.2.1 Gravitational potential 

The gravitational potential is the potential energy of a 
unit mass in a field of gravitational attraction. Let the 
potential be denoted by the symbol U G . The potential 
energy E of a mass m in a gravitational field is thus equal 
to ( mU G ). Thus, a change in potential energy (dis p ) is 
equal to (m dt/ G ). Equation (2.3) becomes, using Eq. (2.1), 

m dU c = Fdr = —ma Q dr (2.6) 

Rearranging this equation we get the gravitational accel¬ 
eration 
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a G 



(2.7) 


In general, the acceleration is a three-dimensional 
vector. If we are using Cartesian coordinates (x, y, z), the 
acceleration will have components (a x , a y , af). These may 
be computed by calculating separately the derivatives of 
the potential with respect to x, y and z: 


auc 

su G 

dU G 

dx 

a y dy 



( 2 . 8 ) 


Equating Eqs. (2.3) and (2.7) gives the gravitational 
potential of a point mass M: 


dU G= M 

dr ° r 2 


(2.9) 


the solution of which is 


U G = 



( 2 . 10 ) 




Fig. 2.5 (a) Each small particle of a solid body exerts a gravitational 
attraction in a different direction at an external point P. (b) Computation 
of the gravitational potential of a continuous mass distribution. 


22.2.2 Acceleration and potential of a distribution of mass 

Until now, we have considered only the gravitational 
acceleration and potential of point masses. A solid body 
may be considered to be composed of numerous small 
particles, each of which exerts a gravitational attraction at 
an external point P (Fig. 2.5a). To calculate the gravita¬ 
tional acceleration of the object at the point P we must 
form a vector sum of the contributions of the individual 
discrete particles. Each contribution has a different direc¬ 
tion. Assuming m j to be the mass of the particle at dis¬ 
tance r. from P, this gives an expression like 

m, m, m, 

a G= (2U) 

r l '2 '3 

Depending on the shape of the solid, this vector sum can 
be quite complicated. 

An alternative solution to the problem is found by first 
calculating the gravitational potential, and then differen¬ 
tiating it as in Eq. (2.5) to get the acceleration. The 
expression for the potential at P is 



This is a scalar sum, which is usually more simple to cal¬ 
culate than a vector sum. 

More commonly, the object is not represented as an 
assemblage of discrete particles but by a continuous mass 
distribution. However, we can subdivide the volume into 
discrete elements; if the density of the matter in each 
volume is known, the mass of the small element can be 
calculated and its contribution to the potential at the 
external point P can be determined. By integrating over 
the volume of the body its gravitational potential at P can 
be calculated. At a point in the body with coordinates (x, 
y, z) let the density be p(x, y, z) and let its distance from P 


be r(x, y, z) as in Fig. 2.5b. The gravitational potential of 
the body at P is 

U o=~ G { {{^zj dxd ^ dz (2A3) 

x y z 

The integration readily gives the gravitational potential 
and acceleration at points inside and outside a hollow or 
homogeneous solid sphere. The values outside a sphere at 
distance r from its center are the same as if the entire mass 
E of the sphere were concentrated at its center (Fig. 2.4b): 


li 

i 

Cl 

~it»] 

(2.14) 

- r E r 

— — Cr^r 

r l 

(2.15) 


2.2.23 Mass and mean density of the Earth 

Equations (2.14) and (2.15) are valid everywhere outside 
a sphere, including on its surface where the distance from 
the center of mass is equal to the mean radius R (Fig. 
2.4c). If we regard the Earth to a first approximation as a 
sphere with mass E and radius R , we can estimate the 
Earth’s mass by rewriting Eq. (2.15) as a scalar equation 
in the form 



The gravitational acceleration at the surface of the 
Earth is only slightly different from mean gravity, about 
9.81 m s -2 , the Earth’s radius is 6371 km, and the gravita¬ 
tional constant is 6.674 X 10 -11 m 3 kg -1 s -2 . The mass of 
the Earth is found to be 5.974 X 10 24 kg. This large number 
is not so meaningful as the mean density of the Earth, 
which may be calculated by dividing the Earth’s mass by its 
volume (yTri? 3 ). A mean density of 5515 kg m -3 is obtained, 
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which is about double the density of crustal rocks. This 
indicates that the Earth’s interior is not homogeneous, 
and implies that density must increase with depth in the 
Earth. 

2.2.3 The equipotential surface 

An equipotential surface is one on which the potential is 
constant. For a sphere of given mass the gravitational 
potential (Eq. (2.15)) varies only with the distance r from 
its center. A certain value of the potential, say U v is real¬ 
ized at a constant radial distance r v Thus, the equipo¬ 
tential surface on which the potential has the value U x is 
a sphere with radius r x , a different equipotential surface 
U 2 is the sphere with radius r 2 . The equipotential sur¬ 
faces of the original spherical mass form a set of concen¬ 
tric spheres (Fig. 2.6a), one of which (e.g., U 0 ) coincides 
with the surface of the spherical mass. This particular 
equipotential surface describes the figure of the spheri¬ 
cal mass. 

By definition, no change in potential takes place (and 
no work is done) in moving from one point to another on 
an equipotential surface. The work done by a force Fin a 
displacement dr is Fdrcosd which is zero when cosd is 
zero, that is, when the angle 6 between the displacement 
and the force is 90°. If no work is done in a motion along 
a gravitational equipotential surface, the force and accel¬ 
eration of the gravitational field must act perpendicular 
to the surface. This normal to the equipotential surface 
defines the vertical , or plumb-line, direction (Fig. 2.6b). 
The plane tangential to the equipotential surface at a 
point defines the horizontal at that point. 

2.3 THE EARTH'S ROTATION 

2.3.1 Introduction 

The rotation of the Earth is a vector, i.e., a quantity char¬ 
acterized by both magnitude and direction. The Earth 
behaves as an elastic body and deforms in response to the 
forces generated by its rotation, becoming slightly flat¬ 
tened at the poles with a compensating bulge at the 
equator. The gravitational attractions of the Sun, Moon 
and planets on the spinning, flattened Earth cause 
changes in its rate of rotation, in the orientation of the 
rotation axis, and in the shape of the Earth’s orbit around 
the Sun. Even without extra-terrestrial influences the 
Earth reacts to tiny displacements of the rotation axis 
from its average position by acquiring a small, unsteady 
wobble. These perturbations reflect a balance between 
gravitation and the forces that originate in the Earth’s 
rotational dynamics. 

2.3.2 Centripetal and centrifugal acceleration 

Newton’s first law of motion states that every object con¬ 
tinues in its state of rest or of uniform motion in a 




Fig. 2.6 (a) Equipotential surfaces of a spherical mass form a set of 
concentric spheres, (b) The normal to the equipotential surface defines 
the vertical direction; the tangential plane defines the horizontal. 

straight line unless compelled to change that state by 
forces acting on it. The continuation of a state of motion 
is by virtue of the inertia of the body. A framework in 
which this law is valid is called an inertial system. For 
example, when we are travelling in a car at constant speed, 
we feel no disturbing forces; reference axes fixed to the 
moving vehicle form an inertial frame. If traffic condi¬ 
tions compel the driver to apply the brakes, we experience 
decelerating forces; if the car goes around a corner, even 
at constant speed, we sense sideways forces toward the 
outside of the corner. In these situations the moving car is 
being forced to change its state of uniform rectilinear 
motion and reference axes fixed to the car form a non- 
inert ial system. 

Motion in a circle implies that a force is active that 
continually changes the state of rectilinear motion. 
Newton recognized that the force was directed inwards, 
towards the center of the circle, and named it the cen¬ 
tripetal (meaning “center-seeking”) force. He cited the 
example of a stone being whirled about in a sling. The 
inward centripetal force exerted on the stone by the sling 
holds it in a circular path. If the sling is released, the 
restraint of the centripetal force is removed and the 
inertia of the stone causes it to continue its motion at 
the point of release. No longer under the influence of the 
restraining force, the stone flies off in a straight line. 
Arguing that the curved path of a projectile near the 
surface of the Earth was due to the effect of gravity, 
which caused it constantly to fall toward the Earth, 
Newton postulated that, if the speed of the projectile 
were exactly right, it might never quite reach the Earth’s 
surface. If the projectile fell toward the center of the 
Earth at the same rate as the curved surface of the Earth 
fell away from it, the projectile would go into orbit around 
the Earth. Newton suggested that the Moon was held in 
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orbit around the Earth by just such a centripetal force, 
which originated in the gravitational attraction of the 
Earth. Likewise, he visualized that a centripetal force due 
to gravitational attraction restrained the planets in their 
circular orbits about the Sun. 

The passenger in a car going round a corner experi¬ 
ences a tendency to be flung outwards. Ele is restrained 
in position by the frame of the vehicle, which supplies 
the necessary centripetal acceleration to enable the pas¬ 
senger to go round the curve in the car. The inertia of the 
passenger's body causes it to continue in a straight line 
and pushes him outwards against the side of the vehicle. 
This outward force is called the centrifugal force. It arises 
because the car does not represent an inertial reference 
frame. An observer outside the car in a fixed (inertial) 
coordinate system would note that the car and passenger 
are constantly changing direction as they round the 
corner. The centrifugal force feels real enough to the 
passenger in the car, but it is called a pseudo-force, or 
inertial force. In contrast to the centripetal force, which 
arises from the gravitational attraction, the centrifugal 
force does not have a physical origin, but exists only 
because it is being observed in a non-inertial reference 
frame. 

2.3.2.1 Centripetal acceleration 

The mathematical form of the centripetal acceleration 
for circular motion with constant angular velocity co 
about a point can be derived as follows. Define orthogo¬ 
nal Cartesian axes x and y relative to the center of the 
circle as in Fig. 2.7a. The linear velocity v at any point 
where the radius vector makes an angle 9 = (cot) with the 
x-axis has components 

v = — vsin(a)t) = —rco sin(wt) 

(2.17) 

v y = vcos(wf) = rcocos(cot) 

The x- and ^-components of the acceleration are 
obtained by differentiating the velocity components with 
respect to time. This gives 

a = — (ovcos((ot) = — rco 2 cos(cot) 

(2-18) 

a y = —wvstn(wt) = — nw-sm(tot) 

These are the components of the centripetal accelera¬ 
tion, which is directed radially inwards and has the mag¬ 
nitude co 2 r (Fig. 2.7b). 

23.2.2 Centrifugal acceleration and potential 

In handling the variation of gravity on the Earth's surface 
we must operate in a non-inertial reference frame attached 
to the rotating Earth. Viewed from a fixed, external inertial 
frame, a stationary mass moves in a circle about the Earth’s 
rotation axis with the same rotational speed as the Earth. 



Fig. 2.7 (a) Components v x and v y of the linear velocity v where the 
radius makes an angle 6 ( o>t ) with the x-axis, and (b) the components a x 
and a y of the centripetal acceleration, which is directed radially inward. 


However, within a rotating reference frame attached to the 
Earth, the mass is stationary. It experiences a centrifugal 
acceleration ( a c ) that is exactly equal and opposite to the 
centripetal acceleration, and which can be written in the 
alternative forms 


a c = co 2 r 



The centrifugal acceleration is not a centrally oriented 
acceleration like gravitation, but instead is defined relative 
to an axis of rotation. Nevertheless, potential energy is 
associated with the rotation and it is possible to define a 
centrifugal potential. Consider a point rotating with the 
Earth at a distance r from its center (Fig. 2.8). The angle 6 
between the radius to the point and the axis of rotation is 
called the colatitude; it is the angular complement of the 
latitude A. The distance of the point from the rotational 
axis is x (= r sin0), and the centrifugal acceleration is ofx 
outwards in the direction of increasing x. The centrifugal 
potential U c is defined such that 

dU 

a c = - -fa x= (w-x)x (2.20) 

where x is the outward unit vector. On integrating, we 
obtain 

U c = — jw 2 x 2 = —jco 2 r 2 cos 2 A = ~^co 2 r 2 sin 2 0 (2.21) 
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Fig. 2.8 The outwardly directed centrifugal acceleration a c at latitude A 
on a sphere rotating at angular velocity &>. 


23.2.3 Kepler's third law of planetary motion 

By comparing the centripetal acceleration of a planet 
about the Sun with the gravitational acceleration of the 
Sun, the third of Kepler's laws of planetary motion can 
be explained. Let S be the mass of the Sun, r p the distance 
of a planet from the Sun, and T p the period of orbital 
rotation of the planet around the Sun. Equating the grav¬ 
itational and centripetal accelerations gives 



( 2 . 22 ) 


Rearranging this equation we get Kepler’s third law of 
planetary motion, which states that the square of the 
period of the planet is proportional to the cube of the 
radius of its orbit, or: 

r 3 

Y 2 ~ ^2 = constant (2.23) 


23.2.4 Verification of the inverse square law of gravitation 

Newton realized that the centripetal acceleration of the 
Moon in its orbit was supplied by the gravitational attrac¬ 
tion of the Earth, and tried to use this knowledge to 
confirm the inverse square dependence on distance in his 
law of gravitation. The sidereal period ( T L ) of the Moon 
about the Earth, a sidereal month, is equal to 27.3 days. 
Let the corresponding angular rate of rotation be o> L . We 
can equate the gravitational acceleration of the Earth at 
the Moon with the centripetal acceleration due to « L : 



(2.24) 


This equation can be rearranged as follows 




(2.25) 


Comparison with Eq. (2.15) shows that the first quan¬ 
tity in parentheses is the mean gravitational acceleration 
on the Earth’s surface, a Q . Therefore, we can write 



(2.26) 


In Newton’s time little was known about the physical 
dimensions of our planet. The distance of the Moon was 
known to be approximately 60 times the radius of the 
Earth (see Section 1.1.3.2) and its sidereal period was 
known to be 27.3 days. At first Newton used the accepted 
value 5500 km for the Earth’s radius. This gave a value of 
only 8.4 m s -2 for gravity, well below the known value 
of 9.8 m s -2 . However, in 1671 Picard determined the 
Earth’s radius to be 6372 km. With this value, the inverse 
square character of Newton’s law of gravitation was 
confirmed. 


2.3.3 The tides 

The gravitational forces of Sun and Moon deform the 
Earth’s shape, causing tides in the oceans, atmosphere 
and solid body of the Earth. The most visible tidal effects 
are the displacements of the ocean surface, which is a 
hydrostatic equipotential surface. The Earth does not 
react rigidly to the tidal forces. The solid body of the 
Earth deforms in a like manner to the free surface, giving 
rise to so-called bodily Earth-tides. These can be observed 
with specially designed instruments, which operate on a 
similar principle to the long-period seismometer. 

The height of the marine equilibrium tide amounts to 
only half a meter or so over the free ocean. In coastal 
areas the tidal height is significantly increased by the 
shallowing of the continental shelf and the confining 
shapes of bays and harbors. Accordingly, the height and 
variation of the tide at any place is influenced strongly 
by complex local factors. Subsequent subsections deal 
with the tidal deformations of the Earth’s hydrostatic 
figure. 

2.3.3.1 Lunar tidal periodicity 

The Earth and Moon are coupled together by gravitational 
attraction. Their common motion is like that of a pair of 
ballroom dancers. Each partner moves around the center 
of mass (or barycenter) of the pair. For the Earth-Moon 
pair the location of the center of mass is easily found. Let 
E be the mass of the Earth, and m that of the Moon; let the 
separation of the centers of the Earth and Moon be r L and 
let the distance of their common center of mass be d from 
the center of the Earth. The moment of the Earth about 
the center of mass is Ed and the moment of the Moon is 
m(r L - d). Setting these moments equal we get 

, m 
d = — r T 

h + m L 


(2.27) 
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Sun. 


The mass of the Moon is 0.0123 that of the Earth and 
the distance between the centers is 384,100 km. These 
figures give rf=4600km, i.e., the center of revolution of 
the Earth-Moon pair lies within the Earth. 

It follows that the paths of the Earth and the Moon 
around the Sun are more complicated than at first 
appears. The elliptical orbit is traced out by the barycen- 
ter of the pair (Fig. 2.9). The Earth and Moon follow 
wobbly paths, which, while always concave towards the 
Sun, bring each body at different times of the month 
alternately inside and outside the elliptical orbit. 

To understand the common revolution of the 
Earth-Moon pair we have to exclude the rotation of the 
Earth about its axis. The “revolution without rotation” is 
illustrated in Fig. 2.10. The Earth-Moon pair revolves 
about S, the center of mass. Let the starting positions be as 
shown in Fig. 2.10a. Approximately one week later the 
Moon has advanced in its path by one-quarter of a revolu¬ 
tion and the center of the Earth has moved so as to keep 
the center of mass fixed (Fig. 2.10b). The relationship is 



Fig. 2.10 Illustration of the "revolution without rotation" of the 
Earth-Moon pair about their common center of mass at S. 


maintained in the following weeks (Fig. 2.10c, d) so that 
during one month the center of the Earth describes a circle 
about S. Now consider the motion of point number 2 on 
the left-hand side of the Earth in Fig. 2.10. If the Earth 
revolves as a rigid body and the rotation about its own axis 
is omitted, after one week point 2 will have moved to a new 
position but will still be the furthest point on the left. 
Subsequently, during one month point 2 will describe a 
small circle with the same radius as the circle described by 
the Earth’s center. Similarly points 1, 3 and 4 will also 
describe circles of exactly the same size. A simple illustra¬ 
tion of this point can be made by chalking the tip of each 
finger on one hand with a different color, then moving your 
hand in a circular motion while touching a blackboard; 
your fingers will draw a set of identical circles. 

The “revolution without rotation” causes each point in 
the body of the Earth to describe a circular path with iden¬ 
tical radius. The centrifugal acceleration of this motion 
has therefore the same magnitude at all points in the Earth 
and, as can be seen by inspection of Fig. 2.10(a-d), it is 
directed away from the Moon parallel to the Earth-Moon 
line of centers. At C, the center of the Earth (Fig. 2.11a), 
this centrifugal acceleration exactly balances the gravita¬ 
tional attraction of the Moon. Its magnitude is given by 

a L = (2.28) 

r L 

At B, on the side of the Earth nearest to the Moon, the 
gravitational acceleration of the Moon is larger than at 
the center of the Earth and exceeds the centrifugal accel¬ 
eration a L . There is a residual acceleration toward the 
Moon, which raises a tide on this side of the Earth. The 
magnitude of the tidal acceleration at B is 
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a L -<- constant centrifugal acceleration 

CL G ->- variable lunar gravitation 

Cl T —£> residual tidal acceleration 



Fig. 2.11 (a) The relationships of the centrifugal, gravitational and 
residual tidal accelerations at selected points in the Earth, (b) Latitude 
effect that causes diurnal inequality of the tidal height. 


fl T =GW f ( , L -/ ? )2-4) (2 ' 29) 

«x=Gj|((l-f) '-l) (2.30) 

Expanding this equation with the binomial theorem and 
simplifying gives 

fl T =G !( 2 i| + 3 ((?) + ■ ) (231) 


At points D and D' the direction of the gravitational 
acceleration due to the Moon is not exactly parallel to the 
line of centers of the Earth-Moon pair. The residual tidal 
acceleration is almost along the direction toward the 
center of the Earth. Its effect is to lower the free surface in 
this direction. 

The free hydrostatic surface of the Earth is an equipo- 
tential surface (Section 2.2.3), which in the absence of the 
Earth's rotation and tidal effects would be a sphere. The 
lunar tidal accelerations perturb the equipotential 
surface, raising it at A and B while lowering it at D and 
D', as in Fig. 2.11a. The tidal deformation of the Earth 
produced by the Moon thus has an almost prolate 
ellipsoidal shape, like a rugby football, along the 
Earth-Moon line of centers. The daily tides are caused 
by superposing the Earth’s rotation on this deformation. 
In the course of one day a point rotates past the points A, 
D, B and D' and an observer experiences two full tidal 
cycles, called the semi-diurnal tides. The extreme tides are 
not equal at every latitude, because of the varying angle 
between the Earth’s rotational axis and the Moon’s orbit 
(Fig. 2.11b). At the equator E the semi-diurnal tides are 
equal; at an intermediate latitude F one tide is higher 
than the other; and at latitude G and higher there is only 
one (diurnal) tide per day. The difference in height 
between two successive high or low tides is called the 
diurnal inequality. 

In the same way that the Moon deforms the Earth, so 
the Earth causes a tidal deformation of the Moon. In 
fact, the tidal relationship between any planet and one of 
its moons, or between the Sun and a planet or comet, can 
be treated analogously to the Earth-Moon pair. A tidal 
acceleration similar to Eq. (2.31) deforms the smaller 
body; its self-gravitation acts to counteract the deforma¬ 
tion. However, if a moon or comet comes too close to the 
planet, the tidal forces deforming it may overwhelm the 
gravitational forces holding it together, so that the moon 
or comet is torn apart. The separation at which this 
occurs is called the Roche limit (Box 2.1). The material of 
a disintegrated moon or comet enters orbit around the 
planet, forming a system of concentric rings, as around 
the great planets (Section 1.1.3.3). 


At A, on the far side of the Earth, the gravitational 
acceleration of the Moon is less than the centrifugal 
acceleration a L . The residual acceleration (Fig. 2.11a) is 
away from the Moon, and raises a tide on the far side of 
the Earth. The magnitude of the tidal acceleration at A is 


which reduces to 
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(2.33) 


23.3.2 Tidal effect of the Sun 

The Sun also has an influence on the tides. The theory of 
the solar tides can be followed in identical manner to the 
lunar tides by again applying the principle of "revolution 
without rotation.’’ The Sun’s mass is 333,000 times greater 
than that of the Earth, so the common center of mass is 
close to the center of the Sun at a radial distance of about 
450 km from its center. The period of the revolution is one 
year. As for the lunar tide, the imbalance between gravita¬ 
tional acceleration of the Sun and centrifugal acceleration 
due to the common revolution leads to a prolate ellip¬ 
soidal tidal deformation. The solar effect is smaller than 
that of the Moon. Although the mass of the Sun is vastly 
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(1) conjunction 
(new moon) 
to the 
Sun 
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(3) opposition 
(full moon) 

Fig. 2.12 The orientations of the solar and lunar tidal deformations of 
the Earth at different lunar phases. 


inclined at a very small angle of about 5° to it. For discus¬ 
sion of the combination of lunar and solar tides we can 
assume the orbits to be coplanar. The Moon and Sun 
each produce a prolate tidal deformation of the Earth, 
but the relative orientations of these ellipsoids vary 
during one month (Fig. 2.12). At conjunction the (new) 
Moon is on the same side of the Earth as the Sun, and the 
ellipsoidal deformations augment each other. The same is 
the case half a month later at opposition, when the (full) 
Moon is on the opposite side of the Earth from the Sun. 
The unusually high tides at opposition and conjunction 
are called spring tides. In contrast, at the times of quadra¬ 
ture the waxing or waning half Moon causes a prolate 
ellipsoidal deformation out of phase with the solar defor¬ 
mation. The maximum lunar tide coincides with the 
minimum solar tide, and the effects partially cancel each 
other. The unusually low tides at quadrature are called 
neap tides. The superposition of the lunar and solar tides 
causes modulation of the tidal amplitude during a month 
(Fig. 2.13). 

2.3.3 A Effect of the tides on gravity measurements 

The tides have an effect on gravity measurements made 
on the Earth. The combined effects of Sun and Moon 
cause an acceleration at the Earth’s surface of approxi¬ 
mately 0.3mgal, of which about two-thirds are due to the 
Moon and one-third to the Sun. The sensitive modern 
instruments used for gravity exploration can readily detect 
gravity differences of 0.01 mgal. It is necessary to compen¬ 


sate gravity measurements for the tidal effects, which vary 
with location, date and time of day. Fortunately, tidal 
theory is so well established that the gravity effect can be 
calculated and tabulated for any place and time before 
beginning a survey. 

23.3.5 Bodily Earth-tides 

A simple way to measure the height of the marine tide 
might be to fix a stake to the sea-bottom at a suitably shel¬ 
tered location and to record continuously the measured 
water level (assuming that confusion introduced by wave 
motion can be eliminated or taken into account). The 
observed amplitude of the marine tide, defined by the dis¬ 
placement of the free water surface, is found to be about 
70% of the theoretical value. The difference is explained 
by the elasticity of the Earth. The tidal deformation cor¬ 
responds to a redistribution of mass, which modifies the 
gravitational potential of the Earth and augments the ele¬ 
vation of the free surface. This is partially counteracted 
by a bodily tide in the solid Earth, which deforms elasti¬ 
cally in response to the attraction of the Sun and Moon. 
The free water surface is raised by the tidal attraction, but 
the sea-bottom in which the measuring rod is implanted is 
also raised. The measured tide is the difference between 
the marine tide and the bodily Earth-tide. 

In practice, the displacement of the equipotential 
surface is measured with a horizontal pendulum, which 
reacts to the tilt of the surface. The bodily Earth-tides 
also affect gravity measurements and can be observed 
with sensitive gravimeters. The effects of the bodily 
Earth-tides are incorporated into the predicted tidal cor¬ 
rections to gravity measurements. 

2.3.4 Changes in Earth's rotation 

The Earth’s rotational vector is affected by the gravita¬ 
tional attractions of the Sun, Moon and the planets. The 
rate of rotation and the orientation of the rotational axis 
change with time. The orbital motion around the Sun is 
also affected. The orbit rotates about the pole to the plane 
of the ecliptic and its ellipticity changes over long periods 
of time. 

2.3.4.1 Effect of lunar tidal friction on the length of the day 

If the Earth reacted perfectly elastically to the lunar tidal 
forces, the prolate tidal bulge would be aligned along the 
line of centers of the Earth-Moon pair (Fig. 2.14a). 
However, the motion of the seas is not instantaneous and 
the tidal response of the solid part of the Earth is partly 
anelastic. These features cause a slight delay in the time 
when high tide is reached, amounting to about 12 
minutes. In this short interval the Earth’s rotation carries 
the line of the maximum tides past the line of centers by a 
small angle of approximately 2.9° (Fig. 2.14b). A point on 
the rotating Earth passes under the line of maximum 
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Fig. 2.13 Schematic 
representation of the 
modulation of the tidal 
amplitude as a result of 
superposition of the lunar and 
solar tides. 




Fig. 2.14 (a) Alignment of the prolate tidal bulge of a perfectly elastic 
Earth along the line of centers of the Earth-Moon pair, (b) Tidal phase 
lag of 2.9° relative to the line of centers due to the Earth's partially 
anelastic response, (c) Tidal decelerating torque due to unequal 
gravitational attractions of the Moon on the far- and near-sided 
tidal bulges. 


tides 12 minutes after it passes under the Moon. The 
small phase difference is called the tidal lag. 

Because of the tidal lag the gravitational attraction of 
the Moon on the tidal bulges on the far side and near side 
of the Earth (Fj and F 2 , respectively) are not collinear 
(Fig. 2.14b). F 2 is stronger than Fj so a torque is produced 
in the opposite sense to the Earth’s rotation (Fig. 2.14c). 
The tidal torque acts as a brake on the Earth’s rate of 
rotation, which is gradually slowing down. 

The tidal deceleration of the Earth is manifested in a 
gradual increase in the length of the day. The elfect is very 
small. Tidal theory predicts an increase in the length of 
the day of only 2.4 milliseconds per century. Observations 
of the phenomenon are based on ancient historical 
records of lunar and solar eclipses and on telescopically 
observed occultations of stars by the Moon. The current 
rate of rotation of the Earth can be measured with very 
accurate atomic clocks. Telescopic observations of the 
daily times of passage of stars past the local zenith are 
recorded with a camera controlled by an atomic clock. 
These observations give precise measures of the mean 
value and fluctuations of the length of the day. 

The occurrence of a lunar or solar eclipse was a 
momentous event for ancient peoples, and was duly 
recorded in scientific and non-scientific chronicles. 
Untimed observations are found in non-astronomical 
works. They record, with variable reliability, the degree of 
totality and the time and place of observation. The 
unaided human eye is able to decide quite precisely just 
when an eclipse becomes total. Timed observations of 
both lunar and solar eclipses made by Arab astronomers 
around 800-1000 AD and Babylonian astronomers a 
thousand years earlier give two important groups of data 
(Fig. 2.15). By comparing the observed times of alignment 
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Fig. 2.15 Long-term changes in the length of the day deduced from 
observations of solar and lunar eclipses between 700 BC and 1980 AD 
(after Stephenson and Morrison, 1984). 


of Sun, Moon and Earth with times predicted from the 
theory of celestial mechanics, the differences due to 
change in length of the day may be computed. A straight 
line with slope equal to the rate of increase of the length of 
the day inferred from tidal theory, 2.4 ms per century, con¬ 
nects the Babylonian and Arab data sets. Since the 
medieval observations of Arab astronomers the length of 
the day has increased on average by about 1.4 ms 
per century. The data set based on telescopic observations 
covers the time from 1620 to 1980 AD. It gives a more 
detailed picture and shows that the length of the day fluc¬ 
tuates about the long-term trend of 1.4 ms per century. A 
possible interpretation of the difference between the two 
slopes is that non-tidal causes have opposed the decelera¬ 
tion of the Earth’s rotation since about 950 AD. It would 
be wrong to infer that some sudden event at that epoch 
caused an abrupt change, because the data are equally 
compatible with a smoothly changing polynomial. The 
observations confirm the importance of tidal braking, but 
they also indicate that tidal friction is not the only mecha¬ 
nism affecting the Earth's rotation. 

The short-term fluctuations in rotation rate are due to 
exchanges of angular momentum with the Earth’s atmos¬ 
phere and core. The atmosphere is tightly coupled to the 
solid Earth. An increase in average global wind speed 
corresponds to an increase in the angular momentum of 
the atmosphere and corresponding decrease in angular 
momentum of the solid Earth. Accurate observations by 
very long baseline interferometry (see Section 2.4.6.6) 
confirm that rapid fluctuations in the length of the day 
are directly related to changes in the angular momentum 


of the atmosphere. On a longer timescale of decades, the 
changes in length of the day may be related to changes in 
the angular momentum of the core. The fluid in the outer 
core has a speed of the order of 0.1 mm s -1 relative to the 
overlying mantle. The mechanism for exchange of 
angular momentum between the fluid core and the rest of 
the Earth depends on the way the core and mantle are 
coupled. The coupling may be mechanical if topographic 
irregularities obstruct the flow of the core fluid along the 
core-mantle interface. The core fluid is a good electrical 
conductor so, if the lower mantle also has an appreciable 
electrical conductivity, it is possible that the core and 
mantle are coupled electromagnetically. 

23.4.2 Increase of the Earth-Moon distance 

Further consequences of lunar tidal friction can be seen 
by applying the law of conservation of momentum to the 
Earth-Moon pair. Let the Earth’s mass be E, its rate of 
rotation be co and its moment of inertia about the rota¬ 
tion axis be C; let the corresponding parameters for the 
Moon be m, fi L , and C L , and let the Earth-Moon dis¬ 
tance be r L . Further, let the distance of the common 
center of revolution be d from the center of the Earth, as 
given by Eq. (2.27). The angular momentum of the 
system is given by 

Cco + Eil L d 2 + mil L {r L — d ) 2 + C L fl L = constant (2.34) 

The fourth term is the angular momentum of the 
Moon about its own axis. Tidal deceleration due to the 
Earth’s attraction has slowed down the Moon’s rotation 
until it equals its rate of revolution about the Earth. Both 
and C L are very small and the fourth term can be 
neglected. The second and third terms can be combined 
so that we get 

Cco + ~ constant (2-35) 

The gravitational attraction of the Earth on the Moon 
is equal to the centripetal acceleration of the Moon about 
the common center of revolution, thus 

^^2 = ^l*'l “ d) = %/l ^E + m) (2.36) 

from which 

fi L /- 2 L = dG(E + m)r h (2.37) 

Inserting this in Eq. (2.35) gives 

Cco + ——^ lu d Q r = constant (2.38) 

y(E + m) 

The first term in this equation decreases, because tidal 
friction reduces co. To conserve angular momentum the 
second term must increase. Thus, lunar tidal braking of the 
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Fig. 2.16 Variation of latitude 
due to superposition of the 
435 day Chandler wobble 
period and an annual 
seasonal component (after 
Carter, 1989). 



Earth’s rotation causes an increase in the Earth-Moon dis¬ 
tance, r L . At present this distance is increasing at about 
3.7 cm yr -1 . As a further consequence Eq. (2.37) shows 
that the Moon’s rate of revolution about the Earth (O l ) - 
and consequently also its synchronous rotation about its 
own axis - must decrease when r L increases. Thus, tidal 
friction slows down the rates of Earth rotation, lunar rota¬ 
tion, and lunar orbital revolution and increases the 
Earth-Moon distance. 

Eventually a situation will evolve in which the Earth’s 
rotation has slowed until it is synchronous with the 
Moon’s own rotation and its orbital revolution about the 
Earth. All three rotations will then be synchronous and 
equivalent to about 48 present Earth days. This will 
happen when the Moon’s distance from Earth is about 88 
times the Earth’s radius (r L = 88 R\ it is presently equal to 
about 607?). The Moon will then be stationary over the 
Earth, and Earth and Moon will constantly present the 
same face to each other. This configuration already exists 
between the planet Pluto and its satellite Charon. 

2.3A.3 The Chandler wobble 

The Earth’s rotation gives it the shape of a spheroid, or 
ellipsoid of revolution. This figure is symmetric with 
respect to the mean axis of rotation, about which the 
moment of inertia is greatest; this is also called the axis of 
figure (see Section 2.4). However, at any moment the 
instantaneous rotational axis is displaced by a few meters 
from the axis of figure. The orientation of the total 
angular momentum vector remains nearly constant but 


the axis of figure changes location with time and appears 
to meander around the rotation axis (Fig. 2.16). 

The theory of this motion was described by Leonhard 
Euler (1707-1783), a Swiss mathematician. He showed 
that the displaced rotational axis of a rigid spheroid would 
execute a circular motion about its mean position, now 
called the Euler nutation. Because it occurs in the absence 
of an external driving torque, it is also called the free nuta¬ 
tion. It is due to differences in the way mass is distributed 
about the axis of rotational symmetry and an axis at right 
angles to it in the equatorial plane. The mass distributions 
are represented by the moments of inertia about these 
axes. If C and A are the moments of inertia about the rota¬ 
tional axis and an axis in the equatorial plane, respectively, 
Euler’s theory shows that the period of free nutation is 
AI(C— A) days, or approximately 305 days. 

Astronomers were unsuccessful in detecting a polar 
motion with this period. In 1891 an American geodesist 
and astronomer, S. C. Chandler, reported that the polar 
motion of the Earth’s axis contained two important com¬ 
ponents. An annual component with amplitude about 
0.10 seconds of arc is due to the transfer of mass between 
atmosphere and hydrosphere accompanying the chang¬ 
ing of the seasons. A slightly larger component with 
amplitude 0.15 seconds of arc has a period of 435 days. 
This polar motion is now called the Chandler wobble. It 
corresponds to the Euler nutation in an elastic Earth. 
The increase in period from 305 days to 435 days is a 
consequence of the elastic yielding of the Earth. The 
superposition of the annual and Chandler frequencies 
results in a beat effect, in which the amplitude of the 
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(a) 


pole to 
ecliptic 




Fig. 2.17 (a) The precession and forced nutation (greatly exaggerated) 
of the rotation axis due to the lunar torque on the spinning Earth (after 
Strahler, 1963). (b) Torque and incremental angular momentum 
changes resulting in precession. 


latitude variation is modulated with a period of 6-7 years 
(Fig. 2.16). 


23.4.4 Precession and nutation of the rotation axis 

During its orbital motion around the Sun the Earth’s axis 
maintains an (almost) constant tilt of about 23.5° to the 
pole to the ecliptic. The line of intersection of the plane of 
the ecliptic with the equatorial plane is called the line of 
equinoxes. Two times a year, when this line points directly 
at the Sun, day and night have equal duration over the 
entire globe. 

In the theory of the tides the unequal lunar attractions 
on the near and far side tidal bulges cause a torque about 
the rotation axis, which has a braking effect on the Earth’s 
rotation. The attractions of the Moon (and Sun) on the 
equatorial bulge due to rotational flattening also produce 
torques on the spinning Earth. On the side of the Earth 
nearer to the Moon (or Sun) the gravitational attraction 
F 2 on the equatorial bulge is greater than the force F 1 on 
the distant side (Fig. 2.17a). Due to the tilt of the rotation 
axis to the ecliptic plane (23.5°), the forces are not 


collinear. A torque results, which acts about a line in the 
equatorial plane, normal to the Earth-Sun line and 
normal to the spin axis. The magnitude of the torque 
changes as the Earth orbits around the Sun. It is 
minimum (and zero) at the spring and autumn equinoxes 
and maximum at the summer and winter solstices. 

The response of a rotating system to an applied torque 
is to acquire an additional component of angular 
momentum parallel to the torque. In our example this will 
be perpendicular to the angular momentum (/?) of the 
spinning Earth. The torque has a component (r) parallel 
to the line of equinoxes (Fig. 2.17b) and a component 
normal to this line in the equatorial plane. The torque r 
causes an increment Ah in angular momentum and shifts 
the angular momentum vector to a new position. If this 
exercise is repeated incrementally, the rotation axis moves 
around the surface of a cone whose axis is the pole to the 
ecliptic (Fig. 2.17a). The geographic pole P moves around 
a circle in the opposite sense from the Earth’s spin. This 
motion is called retrograde precession. It is not a steady 
motion, but pulsates in sympathy with the driving torque. 
A change in orientation of the rotation axis affects the 
location of the line of equinoxes and causes the timing of 
the equinoxes to change slowly. The rate of change is only 
50.4 seconds of arc per year, but it has been recognized 
during centuries of observation. For example, the Earth’s 
rotation axis now points at Polaris in the constellation 
Ursa Minor, but in the time of the Egyptians around 3000 
BC the pole star was Alpha Draconis, the brightest star in 
the constellation Draco. Hipparchus is credited with dis¬ 
covering the precession of the equinoxes in 120 BC by 
comparing his own observations with those of earlier 
astronomers. 

The theory of the phenomenon is well understood. 
The Moon also exerts a torque on the spinning Earth and 
contributes to the precession of the rotation axis (and 
equinoxes). As in the theory of the tides, the small size of 
the Moon compared to the Sun is more than compen¬ 
sated by its nearness, so that the precessional contribution 
of the Moon is about double the effect of the Sun. The 
theory of precession shows that the period of 25,700 yr is 
proportional to the Earth’s dynamical ellipticitv, H (see 
Eq. (2.45)). This ratio (equal to 1/305.457) is an impor¬ 
tant indicator of the internal distribution of mass in the 
Earth. 

The component of the torque in the equatorial plane 
adds an additional motion to the axis, called nutation, 
because it causes the axis to nod up and down (Fig. 
2.17a). The solar torque causes a semi-annual nutation, 
the lunar torque a semi-monthly one. In fact the motion 
of the axis exhibits many forced nutations, so-called 
because they respond to external torques. All are tiny per¬ 
turbations on the precessional motion, the largest having 
an amplitude of only about 9 seconds of arc and a period 
of 18.6yr. This nutation results from the fact that the 
plane of the lunar orbit is inclined at 5.145° to the plane 
of the ecliptic and (like the motion of artificial Earth 
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satellites) precesses retrogradely. This causes the inclina¬ 
tion of the lunar orbit to the equatorial plane to vary 
between about 18.4° and 28.6°, modulating the torque 
and forcing a nutation with a period of 18.6 yr. 

It is important to note that the Euler nutation and 
Chandler wobble are polar motions about the rotation 
axis, but the precession and forced nutations are displace¬ 
ments of the rotation axis itself. 

2.34.5 Milankovitch climatic cycles 

Solar energy can be imagined as flowing equally from the 
Sun in all directions. At distance r it floods a sphere with 
surface area 4Tn- 2 . The amount of solar energy falling per 
second on a square meter (the insolation) therefore 
decreases as the inverse square of the distance from the 
Sun. The gravitational attractions of the Moon, Sun, and 
the other planets - especially Jupiter - cause cyclical 
changes of the orientation of the rotation axis and varia¬ 
tions in the shape and orientation of Earth’s orbit. These 
variations modify the insolation of the Earth and result in 
long-term periodic changes in Earth’s climate. 

The angle between the rotational axis and the pole to 
the ecliptic is called the obliquity. It is the main factor 
determining the seasonal difference between summer and 
winter in each hemisphere. In the northern hemisphere, the 
insolation is maximum at the summer solstice (currently 
June 21) and minimum at the winter solstice (December 
21-22). The exact dates change with the precession of the 
equinoxes, and also depend on the occurrence of leap 
years. The solstices do not coincide with extreme positions 
in Earth’s orbit. The Earth currently reaches aphelion, its 
furthest distance from the Sun, around July 4-6, shortly 
after the summer solstice, and passes perihelion around 
January 2-4. About 13,000 yr from now, as a result of pre¬ 
cession, the summer solstice will occur when Earth is close 
to perihelion. In this way, precession causes long-term 
changes in climate with a period related to the precession. 

The gravitational attraction of the other planets causes 
the obliquity to change cyclically with time. It is currently 
equal to 23° 26' 21.4" but varies slowly between a minimum 
of 21° 55' and a maximum of 24° 18'. When the obliquity 
increases, the seasonal differences in temperature become 
more pronounced, while the opposite effect ensues if obliq¬ 
uity decreases. Thus, the variation in obliquity causes a 
modulation in the seasonal contrast between summer and 
winter on a global scale. This effect is manifest as a cyclical 
change in climate with a period of about 41 kyr. 

A further effect of planetary attraction is to cause the 
eccentricity of the Earth’s orbit, at present 0.017, to 
change cyclically (Fig. 2.18). At one extreme of the cycle, 
the orbit is almost circular, with an eccentricity of only 
0.005. The closest distance from the Sun at perihelion is 
then 99% of the furthest distance at aphelion. At the 
other extreme, the orbit is more elongate, although with 
an eccentricity of 0.058 it is only slightly elliptical. The 
perihelion distance is then 89% of the aphelion distance. 



Fig. 2.18 Schematic illustration of the 100,000 yr variations in 
eccentricity and rotation of the axis of the Earth's elliptical orbit. The 
effects are greatly exaggerated for ease of visualization. 

These slight differences have climatic effects. When the 
orbit is almost circular, the difference in insolation 
between summer and winter is negligible. However, when 
the orbit is most elongate, the insolation in winter is only 
78% of the summer insolation. The cyclical variation in 
eccentricity has a dominant period of 404 kyr and lesser 
periodicities of 95kyr, 99kyr, 124kyr and 131 kyr that 
together give a roughly 100 kyr period. The eccentricity 
variations generate fluctuations in paleoclimatic records 
with periods around 100 kyr and 400 kyr. 

Not only does planetary attraction cause the shape of 
the orbit to change, it also causes the perihelion-aphelion 
axis of the orbit to precess. The orbital ellipse is not truly 
closed, and the path of the Earth describes a rosette with a 
period that is also around 100 kyr (Fig. 2.18). The preces¬ 
sion of perihelion interacts with the axial precession and 
modifies the observed period of the equinoxes. The 26 kyr 
axial precession is retrograde with a rate of 0.038 
cycles/kyr; the 100 kyr orbital precession is prograde, which 
speeds up the effective precession rate to 0.048 cycles/kyr. 
This is equivalent to a retrograde precession with a period 
of about 21 kyr. A corresponding climatic fluctuation has 
been interpreted in many sedimentary deposits. 

Climatic effects related to cyclical changes in the Earth’s 
rotational and orbital parameters were first studied 
between 1920 and 1938 by a Yugoslavian astronomer, 
Milutin Milankovic (anglicized to Milankovitch). Period¬ 
icities of 21 kyr, 41 kyr, 100 kyr and 400 kyr - called the 
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Milankovitch climatic cycles - have been described in 
various sedimentary records ranging in age from Quater¬ 
nary to Mesozoic. Caution must be used in interpreting 
the cyclicities in older records, as the characteristic 
Milankovitch periods are dependent on astronomical 
parameters that may have changed appreciably during the 
geological ages. 

2.3.5 Coriolis and Eotvos accelerations 

Every object on the Earth experiences the centrifugal 
acceleration due to the Earth’s rotation. Moving objects 
on the rotating Earth experience additional accelerations 
related to the velocity at which they are moving. At 
latitude A the distance d of a point on the Earth's surface 
from the rotational axis is equal to RcosA, and the rota¬ 
tional spin w translates to an eastwards linear velocity v 
equal to wRcosA. Consider an object (e.g., a vehicle or 
projectile) that is moving at velocity v across the Earth’s 
surface. In general v has a northward component v N and 
an eastward component v E . Consider first the effects 
related to the eastward velocity, which is added to the 
linear velocity of the rotation. The centrifugal accelera¬ 
tion increases by an amount A a c , which can be obtained 
by differentiating a c in Eq. (2.19) with respect to w 

Aa c = 2w(i?cosA)Aw = 2wv e (2.39) 

The extra centrifugal acceleration A a c can be resolved 
into a vertical component and a horizontal component 
(Fig. 2.19a). The vertical component, equal to 2«v E cosA, 
acts upward, opposite to gravity. It is called the Eotvos 
acceleration. Its effect is to decrease the measured gravity 
by a small amount. If the moving object has a westward 
component of velocity the Eotvos acceleration increases 
the measured gravity. If gravity measurements are made 
on a moving platform (for example, on a research ship or 
in an airplane), the measured gravity must be corrected to 
allow for the Eotvos effect. For a ship sailing eastward at 
10 km h -1 at latitude 45° the Eotvos correction is 
28.6mgal; in an airplane flying eastward at 300km h -1 
the correction is 856 mgal. These corrections are far 
greater than the sizes of many important gravity anom¬ 
alies. However, the Eotvos correction can be made satis¬ 
factorily in marine gravity surveys, and recent technical 
advances now make it feasible in aerogravimetry. 

The horizontal component of the extra centrifugal 
acceleration due to v E is equal to 2cov E sinA. In the north¬ 
ern hemisphere it acts to the south. If the object moves 
westward, the acceleration is northward. In each case it 
acts horizontally to the right of the direction of motion. 
In the southern hemisphere the sense of this acceleration 
is reversed; it acts to the left of the direction of motion. 
This acceleration is a component of the Coriolis accelera¬ 
tion , another component of which derives from the north¬ 
ward motion of the object. 

Consider an object moving northward along a meridian 
of longitude (Fig. 2.19b, point 1). The linear velocity of a 




Fig. 2.19 (a) Resolution of the additional centrifugal acceleration Aa c 
due to eastward velocity into vertical and horizontal components, (b) 

The horizontal deviations of the northward or southward trajectory of 
an object due to conservation of its angular momentum. 

point on the Earth’s surface decreases poleward, because 
the distance from the axis of rotation (d=Rcos\) 
decreases. The angular momentum of the moving object 
must be conserved, so the eastward velocity v E must 
increase. As the object moves to the north its eastward 
velocity is faster than the circles of latitude it crosses and 
its trajectory deviates to the right. If the motion is to the 
south (Fig. 2.19b, point 2), the inverse argument applies. 
The body crosses circles of latitude with faster eastward 
velocity than its own and, in order to maintain angular 
momentum, its trajectory must deviate to the west. In each 
case the deviation is to the right of the direction of motion. 
A similar argument applied to the southern hemisphere 
gives a Coriolis effect to the left of the direction of motion 
(Fig. 2.19b, points 3 and 4). 

The magnitude of the Coriolis acceleration is easily 
evaluated quantitatively. The angular momentum /; of a 
mass m at latitude A is equal to mcoR 2 cos 2 \. Conservation 
of angular momentum gives 

|iy = mR 2 cos 2 A^ + ma]R 2 (— 2cosAsinA)|jy = 0 (2.40) 

Rearranging and simplifying, we get 




2.4 THE EARTH'S FIGURE AND GRAVITY 


61 


CRcosA)^ = 2<usinACR^) (2.41) 

The expression on the left of the equation is an acceler¬ 
ation, a E , equal to the rate of change of the eastward 
velocity. The expression in brackets on the right is the 
northward velocity component v N . We can write this 
component of the Coriolis acceleration as 2<wv N sinA. The 
north and east components of the Coriolis acceleration 
are therefore: 

a N = 2wv F sinA 

N E . (2.42) 

a E = 2wr N sinA 

The Coriolis acceleration deflects the horizontal path 
of any object moving on the Earth’s surface. It affects the 
directions of wind and ocean currents, eventually con¬ 
straining them to form circulatory patterns about centers 
of high or low pressure, and thereby plays an important 
role in determining the weather. 

2.4 THE EARTH'S FIGURE AND GRAVITY 

2.4.1 The figure of the Earth 

The true surface of the Earth is uneven and irregular, 
partly land and partly water. For geophysical purposes 
the Earth’s shape is represented by a smooth closed 
surface, which is called the figure of the Earth. Early con¬ 
cepts of the figure were governed by religion, superstition 
and non-scientific beliefs. The first circumnavigation of 
the Earth, completed in 1522 by Magellan’s crew, estab¬ 
lished that the Earth was probably round. Before the era 
of scientific awakening the Earth’s shape was believed to 
be a sphere. As confirmed by numerous photographs 
from spacecraft, this is in fact an excellent first approxi¬ 
mation to Earth’s shape that is adequate for solving many 
problems. The original suggestion that the Earth is a 
spheroid flattened at the poles is credited to Newton, who 
used a hydrostatic argument to account for the polar flat¬ 
tening. The slightly flattened shape permitted an explana¬ 
tion of why a clock that was precise in Paris lost time near 
to the equator (see Section 2.1). 

Earth’s shape and gravity are intimately associated. 
The figure of the Earth is the shape of an equipotential 
surface of gravity, in particular the one that coincides 
with mean sea level. The best mathematical approxima¬ 
tion to the figure is an oblate ellipsoid, or spheroid (Fig. 
2.20). The precise determination of the dimensions of the 
Earth (e.g., its polar and equatorial radii) is the main 
objective of the science of geodesy. It requires an exact 
knowledge of the Earth’s gravity field, the description of 
which is the goal of gravimetry. 

Modern analyses of the Earth’s shape are based on 
precise observations of the orbits of artificial Earth satel¬ 
lites. These data are used to define a best-fitting oblate 
ellipsoid, called the International Reference Ellipsoid. In 
1930 geodesists and geophysicists defined an optimum 



Fig. 2.20 Comparison of the dimensions of the International Reference 
Ellipsoid with a sphere of equal volume. 

reference ellipsoid based on the best available data at the 
time. The dimensions of this figure have been subse¬ 
quently refined as more exact data have become available. 
In 1980 the International Association of Geodesy adopted 
a Geodetic Reference System (GRS80) in which the refer¬ 
ence ellipsoid has an equatorial radius (a) equal to 
6378.137 km and a polar radius (c) equal to 6356.752 km. 
Subsequent determinations have resulted in only minor 
differences in the most important geodetic parameters. 
Some current values are listed in Table 2.1. The radius of 
the equivalent sphere (R) is found from R = (a 2 c) m to be 
6371.000 km. Compared to the best-fitting sphere the 
spheroid is flattened by about 14.2 km at each pole and 
the equator bulges by about 7.1 km. The polar flattening / 
is defined as the ratio 

/=T (2.43) 

The flattening of the optimum reference ellipsoid defined 
in 1930 was exactly 1/297. This ellipsoid, and the 
variation of gravity on its surface, served as the basis of 
gravimetric surveying for many years, until the era of 
satellite geodesy and highly sensitive gravimeters showed 
it to be too inexact. A recent best estimate of the flatten¬ 
ing is / = 3.352 87 X 10“ 3 (i.e., /= 1/298.252). 

If the Earth is assumed to be a rotating fluid in perfect 
hydrostatic equilibrium (as assumed by Newton’s theory), 
the flattening should be 1/299.5, slightly smaller than the 
observed value. The hydrostatic condition assumes that 
the Earth has no internal strength. A possible explanation 
for the tiny discrepancy in / is that the Earth has sufficient 
strength to maintain a non-hydrostatic figure, and the 
present figure is inherited from a time of more rapid rota¬ 
tion. Alternatively, the slightly more flattened form of the 
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Table 2.1 Some fundamental parameters relevant to the shape, rotation and orbit of the Earth. Sources: [1 ] Mohr and 
Taylor, 2005; [2] McCarthy and Petit, 2004: [3] Groten, 2004 


Parameter 

Symbol 

Value 

Units 

Reference 

Terrestrial parameters (2004) 


6.673 Xl(T n 



Gravitational constant 

G 

m 3 kg~* s~ 2 

[1] 

Geocentric gravitational constant 

GE 

3.9860044 X 10 14 

m 3 s~ 2 

[2] 

Mass of the Earth: E=(GE)IG 

E 

5.9737 X 10 24 

kg 


Earth’s equatorial radius 

a 

6 378.137 

km 

[3] 

Earth’s polar radius: c = a{ 1 -J) 

c 

6 356.752 

km 


Radius of equivalent sphere: R 0 = ( a 2 c) m 

R o 

6 371.000 

km 


Mean equatorial gravity 

g e 

9.7803278 

m s- 2 

[2] 

Mean angular velocity of rotation 

n 

7.292115 X 10~ 5 

rad s _1 

[2] 

Dynamical form-factor 

J 2 

1.0826359 X 10~ 3 


[3] 

Flattening 

f 

1 :298.252 


[3] 

Equatorial acceleration ratio 

m 

1 :288.901 


[3] 

Dynamical ellipticity 

H 

1 :305.457 


[3] 

Orbital parameters (2003) 





Astronomical unit 

AU 

149,597,870.691 

km 

[2] 

Solar mass ratio 

Ps 

332,946.0 


[2] 

Lunar mass ratio 

P L 

0.012300038 


[2] 

Obliquity of the ecliptic 

£ o 

23° 26' 21.4" 


[2] 

Obliquity of lunar orbit to ecliptic 

5° 0.9' 


[3] 

Eccentricity of solar orbit of barycenter 


0.01671 


[3] 

Eccentricity of lunar orbit 


0.05490 


[3] 


Earth may be due to internal density contrasts, which 
could be the consequence of slow convection in the 
Earth’s mantle. This would take place over long time 
intervals and could result in a non-hydrostatic mass dis¬ 
tribution. 

The cause of the polar flattening is the deforming 
effect of the centrifugal acceleration. This is maximum 
at the equator where the gravitational acceleration is 
smallest. The parameter m is defined as the ratio of the 
equatorial centrifugal acceleration to the equatorial grav¬ 
itational acceleration: 


ufa _ ufcP 
GE/a 2 GE 


(2.44) 


The value of m based on current geodetic values (Table 
2.1) is 3.461 39 X 10 -3 (i.e., m = 1/288.901). 

As a result of the flattening, the distribution of mass 
within the Earth is not simply dependent on radius. The 
moments of inertia of the Earth about the rotation axis 
(C) and any axis in the equatorial plane (A) are unequal. 
As noted in the previous section the inequality affects the 
way the Earth responds to external gravitational torques 
and is a determining factor in perturbations of the 
Earth’s rotation. The principal moments of inertia define 
the dynamical ellipticity : 


H= 


C-\(A + B) c-A 

c ~ c 


(2.45) 


The dynamical ellipticity is obtained from precise obser¬ 
vations of the orbits of artificial satellites of the Earth 
(see Section 2.4.5.1). The current optimum value for His 
3.273 787 5 X 10“ 3 (i.e., H= 1/305.457). 


z 



Fig. 2.21 Parameters of the ellipsoid used in MacCullagh's formula. A, 
B, and C are moments of inertia about the x-,y- and z-axes, respectively, 
and / is the moment of inertia about the line OP. 


2.4.2 Gravitational potential of the spheroidal Earth 

The ellipsoidal shape changes the gravitational potential 
of the Earth from that of an undeformed sphere. In 
1849 J. MacCullagh developed the following formula for 
the gravitational potential of any body at large distance 
from its center of mass: 



(A + B+C-M) 
2 r 3 


(2.46) 
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The first term, of order r~ l , is the gravitational potential of 
a point mass or sphere with mass E (Eqs. (2.10) and (2.14)); 
for the Earth it describes the potential of the undeformed 
globe. If the reference axes are centered on the body’s 
center of mass, there is no term in r~ 2 . The second term, of 
order r -3 , is due to deviations from the spherical shape. For 
the flattened Earth it results from the mass displacements 
due to the rotational deformation. The parameters A, B, 
and C are the principal moments of inertia of the body and 
I is the moment of inertia about the line OP joining the 
center of mass to the point of observation (Fig. 2.21). In 
order to express the potential accurately an infinite number 
of terms of higher order in r are needed. In the case of the 
Earth these can be neglected, because the next term is 
about 1000 times smaller than the second term. 

For a body with planes of symmetry, I is a simple com¬ 
bination of the principal moments of inertia. Setting A 
equal to B for rotational symmetry, and defining the angle 
between OP and the rotation axis to be 6, the expression 
for /is 

I = A sin 2 0 + C cos 2 0 (2.47) 

MacCullagh’s formula for the ellipsoidal Earth then 
becomes 


16.5 m 



Fig. 2.22 The third-order term in the gravitational potential 
describes a pear-shaped Earth. The deviations from the reference 
ellipsoid are of the order of 10-20 m, much smaller than the 
deviations of the ellipsoid from a sphere, which are of the order 
of 10-20 km. 


dynamical form-factor J 2 , which describes the effect of 
the polar flattening on the Earth’s gravitational poten¬ 
tial. Comparison of terms in Eqs. (2.48) and (2.51) gives 
the result 

/ 2 = ^# (2.52) 


U 0 =~G 


E 

r 


-G 


(C-A)( 3cos 2 0- 1) 
r 3 2 


(2.48) 


The function (3cos 2 d - l )/2 is a second-order polyno¬ 
mial in cost), written as P 2 ( cosd). It belongs to a family of 
functions called Legendre polynomials (Box 2.2). Using 
this notation MacCullagh’s formula for the gravitational 
potential of the oblate ellipsoid becomes 

E ( C-A ) 

U G = -Gf-G^—^PA cosd) (2.49) 

This can be written in the alternative form 


U G = - G f(i-(^t)(7)^2(cos0)) (2.50) 

Potential theory requires that the gravitational poten¬ 
tial of the spheroidal Earth must satisfy an important 
equation, the Laplace equation (Box 2.3). The solution of 
this equation is the sum of an infinite number of terms of 
increasing order in Mr, each involving an appropriate 
Legendre polynomial: 


U G = -Gf(l - i(f)V„(cos0)) (2.51) 

In this equation the coefficients J n multiplying 
P n (cosd) determine the relative importance of the term 
of «th order. The values of J n are obtained from satellite 
geodesy: J 2 = 1082.6 X 10 -6 ; / 3 = — 2.54X 10 -6 ; 

/ 4 =-1.59X 10 -6 ; higher orders are insignificant. The 
most important coefficient is the second order one, the 


The term of next higher order (« = 3) in Eq. (2.51) 
describes the deviations from the reference ellipsoid 
which correspond to a pear-shaped Earth (Fig. 2.22). 
These deviations are of the order of 7-17 m, a thousand 
times smaller than the deviations of the ellipsoid from a 
sphere, which are of the order of 7-14 km. 

2.4.3 Gravity and its potential 

The potential of gravity ( U ) is the sum of the gravita¬ 
tional and centrifugal potentials. It is often called the 
geopotential. At a point on the surface of the rotating 
spheroid it can be written 

U g =U G ~ ju 2 r 2 sin 2 0 (2.53) 

If the free surface is an equipotential surface of 
gravity, then U g is everywhere constant on it. The shape of 
the equipotential surface is constrained to be that of the 
spheroid with flattening /. Under these conditions a 
simple relation is found between the constants /, m 
and J 2 \ 

J 2 = \(lf-m) (2.54) 

By equating Eqs. (2.52) and (2.54) and re-ordering 

terms slightly we obtain the following relationship 

^# = i(2 f-rri) (2.55) 

This yields useful information about the variation of 
density within the Earth. The quantities /, m and 
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Box 2.2: Legendre polynomials 


In the triangle depicted in Fig. B2.2 the side it is related 
to the other two sides r and R and the angle 9 they 
enclose by the cosine law. The expression for 1/m can 
then be written: 


1 = _1_ 

11 (R 2 + r 2 — 2rRcos0) 1/2 


1 + | -^-2-^cosfl 


- 1/2 


(1) 


which on expanding becomes 


1 = 1 
U R 


, , r n , r 2 ( 3 cos 2 6 — 1 ^ 

1+ ^ cosfl + M-2-j 


r 3 /5cos 3 0-3cos0) 
R 3 \ 2 


( 2 ) 


This infinitely long series of terms in (r/R) is called the 
reciprocal distance formula. It can be written in short¬ 
hand form as 


1 

u 


1 

R 


t 

n—1 


R 


PfcosO) 


( 3 ) 



Fig. B2.2 Reference triangle for derivation of Legendre 
polynomials. 

This, named in his honor, is the Legendre equation. It 
plays an important role in geophysical potential theory 
for situations expressed in spherical coordinates that 
have rotational symmetry about an axis. This is, for 
example, the case for the gravitational attraction of a 
spheroid, the simplified form of the Earth’s shape. 

The derivation of an individual polynomial of order 
n is rather tedious if the expanded expression for (1/m) is 
used. A simple formula for calculating the Legendre 
polynomials for any order n was developed by another 
French mathematician, Olinde Rodrigues (1794-1851). 
The Rodrigues formula is 


The angle 6 in this expression describes the angular 
deviation between the side r and the reference side R. The 
functions P n (cos 6) in the sum are called the ordinary 
Legendre polynomials of order n in cos 6. They are named 
after a French mathematician Adrien Marie Legendre 
(1752-1833). Each polynomial is a coefficient of ( r/R) n in 
the infinite sum of terms for (1/m), and so has order n. 
Writing cos 6 = x, and P n ( cos 9) = P„(x), the first few poly¬ 
nomials, for n = 0,1,2, and 3, respectively, are as follows 

W = 1 P 1 (x) = \(3x 2 -\) 

: (4) 

Pfx) = x Pfx) — ^(5x 3 — 3x) 

By substituting cos 9 for x these expressions can be 
converted into functions of cost/. Legendre discovered 
that the polynomials satisfied the following second- 
order differential equation, in which n is an integer and 
y = P(x): 

^ (1_x2 )S + w(w+1) 3 ; = 0 ( 5 ) 


( 6 ) 

A relative of this equation encountered in many prob¬ 
lems of potential theory is the associated Legendre equa¬ 
tion , which written as a function of x is 

ic (1 - * 2 ) § + ( M ( w +1 ) - =0 (7) 

The solutions of this equation involve two integers, 
the order n and degree m. As in the case of the ordinary 
Legendre equation the solutions are polynomials in x, 
which are called the associated Legendre polynomials 
and written P'”(x). A modification of the Rodrigues 
formula allows easy computation of these functions 
from the ordinary Legendre polynomials: 

C(x) = (l -x 2 r /2 .^>„(x) (8) 

To express the associated Legendre polynomials as 
functions of 9, i.e. as P" 1 '(cos9), it is again only neces¬ 
sary to substitute cos 9 for x. 


( C—A)/C are each equal to approximately 1/300. 
Inserting their values in the equation gives C«0.33FR 2 . 
Compare this value with the principal moments of inertia 
of a hollow spherical shell (0.66 ER 2 ) and a solid sphere 
with uniform density (0.4 ER 2 ). The concentration of 
mass near the center causes a reduction in the multiplying 
factor from 0.66 to 0.4. The value of 0.33 for the Earth 


implies that, in comparison with a uniform solid sphere, 
the density must increase towards the center of the Earth. 

2.4.4 Normal gravity 

The direction of gravity at a point is defined as perpendic¬ 
ular to the equipotential surface through the point. This 
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Box 2.3: Spherical harmonics 


Many natural forces are directed towards a central 
point. Examples are the electrical field of a point 
charge, the magnetic field of a single magnetic pole, and 
the gravitational acceleration toward a center of mass. 
The French astronomer and mathematician Pierre 
Simon, marquis de Laplace (1749-1827) showed that, 
in order to fulfil this basic physical condition, the 
potential of the field must satisfy a second-order 
differential equation, the Laplace equation. This is one 
of the most famous and important equations in physics 
and geophysics, as it applies to many situations in 
potential theory. For the gravitational potential U c the 
Laplace equation is written in Cartesian coordinates 
(x, y, z) as 


d 2 U G d 2 U G 
dx 2 dy 2 


d 2 U C 

dr 2 


; = 0 


( 1 ) 


the point of observation from the reference axis (see 
Box 2.1). In geographic coordinates 9 is the co-latitude. 

If the potential field is not rotationally symmetric - 
as is the case, for example, for the geoid and the Earth’s 
magnetic field - the solution of the Laplace equation 
varies with azimuth f as well as with radius r and axial 
angle 9 and is given by 

U G = ii ( A n r " + )i(aS'cosm<f> 

n— 0 ' fm =0 

+ b™ sin mf)P™ (cos 9) (4) 

where in this case P™(cosd) is an associated Legendre 
polynomial of order n and degree m as described in Box 
2.2. This equation can in turn be written in modified 
form as 


In spherical polar coordinates (r, 9 , <j>) the Laplace 
equation becomes 


B„ 




(5) 


1 d ^ 5 Ug 

r 2 dr dr 


1 d ■ , 1 d-U G 

: — SU10 - h 7j . -pr- 

r 2 sind dd dQ r 2 sin-0 dc> 2 


0 ( 2 ) 


The variation with azimuth </> disappears for symmetry 
about the rotational axis. The general solution of the 
Laplace equation for rotational symmetry (e.g., for a 
spheroidal Earth) is 


U G=1k[ A n rn + -^i)p„(cosd) (3) 

where P n (cos9) is an ordinary Legendre polynomial of 
order n and the coordinate 6 is the angular deviation of 


The function 

T”'(0,/) = (a™ cos m<l> + h™sin;«(/>) P'"(cos6) (6) 

is called a spherical harmonic function, because it has the 
same value when 6 or / is increased by an integral multi¬ 
ple of 2 tt. It describes the variation of the potential with 
the coordinates 9 and / on a spherical surface (i.e., for 
which r is a constant). Spherical harmonic functions are 
used, for example, for describing the variations of the 
gravitational and magnetic potentials, geoid height, and 
global heat flow with latitude and longitude on the 
surface of the Earth. 


defines the vertical at the point, while the plane tangential 
to the equipotential surface defines the horizontal (Fig. 
2.20). A consequence of the spheroidal shape of the 
Earth is that the vertical direction is generally not radial, 
except on the equator and at the poles. 

On a spherical Earth there is no ambiguity in how we 
define latitude. It is the angle at the center of the Earth 
between the radius and the equator, the complement to 
the polar angle 9. This defines the geocentric latitude A'. 
However, the geographic latitude in common use is not 
defined in this way. It is found by geodetic measurement 
of the angle of elevation of a fixed star above the 
horizon. But the horizontal plane is tangential to the 
ellipsoid, not to a sphere (Fig. 2.20), and the vertical 
direction (i.e., the local direction of gravity) intersects 
the equator at an angle A that is slightly larger than the 
geocentric latitude A' (Fig. 2.23). The difference (A - A') 
is zero at the equator and poles and reaches a maximum 


at a latitude of 45°, where it amounts to only 0.19° 
(about 12'). 

The International Reference Ellipsoid is the standard¬ 
ized reference figure of the Earth. The theoretical value of 
gravity on the rotating ellipsoid can be computed by 
differentiating the gravity potential (Eq. (2.53)). This 
yields the radial and transverse components of gravity, 
which are then combined to give the following formula 
for gravity normal to the ellipsoid: 

g n = g e ( 1 + PfirfX + p 2 sin 2 2A) (2.56) 

Where, to second order in/and m, 

gn = S e ( 1 +/“ \ m +/ 2 “ 1 

P\ = \ m “/+ “ fyf m (2 ' 57) 

Pi = g / 2 - \f m 
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gravity = g = a G + a c 

Fig. 2.23 Gravity on the ellipsoidal Earth is the vector sum of the 
gravitational and centrifugal accelerations and is not radial; 
consequently, geographic latitude (A) is slightly larger than geocentric 
latitude (AO- 



local 

(b) gravity 



Fig. 2.24 (a) A mass outside the ellipsoid or (b) a mass excess below the 
ellipsoid elevates the geoid above the ellipsoid. N is the geoid 
undulation. 


Equation (2.56) is known as the normal gravity 
formula. The constants in the formula, defined in 1980 
for the Geodetic Reference System (GRS80) still in com¬ 
mon use, are: g e = 9.780 327m s -2 ; f5 x = 5.30244 X 10 -3 ; 
/3 2 = —5.8X10 -6 . They allow calculation of normal 
gravity at any latitude with an accuracy of O.lmgal. 
Modern instruments can measure gravity differences 
with even greater precision, in which case a more exact 
formula, accurate to 0.0001 mgal, can be used. The 
normal gravity formula is very important in the analysis 
of gravity measurements on the Earth, because it gives 
the theoretical variation of normal gravity (g n ) with 
latitude on the surface of the reference ellipsoid. 

The normal gravity is expressed in terms of g e , the 
value of gravity on the equator. The second-order terms 
/ 2 , m 2 and fin are about 300 times smaller than the first- 
order terms / and m. The constant /3 2 is about 1000 
times smaller than /3 r If we drop second-order terms 
and use A = 90°, the value of normal gravity at the pole is 
gp — g e (1 -T /3j), so by rearranging and retaining only 
first-order terms, we get 

gp ~ Se 5 

g e =2 m ~f (2.58) 

This expression is called Clairaut’s theorem. It was devel¬ 
oped in 1743 by a French mathematician, Alexis-Claude 
Clairaut, who was the first to relate the variation of gravity 
on the rotating Earth with the flattening of the spheroid. 
The normal gravity formula gives g p = 9.832 186m s -2 . 
Numerically, this gives an increase in gravity from 
equator to pole of approximately 5.186 X 10 -2 m s -2 , or 
5186 mgal. 


There are two obvious reasons for the poleward 
increase in gravity. The distance to the center of mass of 
the Earth is shorter at the poles than at the equator. This 
gives a stronger gravitational acceleration (a C} ) at the 
poles. The difference is 

Mf-f) <2 - 591 

This gives an excess gravity of approximately 
6600 mgal at the poles. The effect of the centrifugal force in 
diminishing gravity is largest at the equator, where it 
equals ( ma Q ), and is zero at the poles. This also results in a 
poleward increase of gravity, amounting to about 3375 
mgal. These figures indicate that gravity should increase by 
a total of 9975 mgal from equator to pole, instead of the 
observed difference of 5186 mgal. The discrepancy can be 
resolved by taking into account a third factor. The compu¬ 
tation of the difference in gravitational attraction is not so 
simple as indicated by Eq. (2.59). The equatorial bulge 
places an excess of mass under the equator, increasing the 
equatorial gravitational attraction and thereby reducing 
the gravity decrease from equator to pole. 

2.4.5 The geoid 

The international reference ellipsoid is a close approxi¬ 
mation to the equipotential surface of gravity, but it is 
really a mathematical convenience. The physical equipo¬ 
tential surface of gravity is called the geoid. It reflects the 
true distribution of mass inside the Earth and differs 
from the theoretical ellipsoid by small amounts. Far from 
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Fig. 2.25 World map of 
geoid undulations relative to a 
reference ellipsoid of 
flattening/ 1/298.257 (after 
Lerch eta!., 1979). 



180°W 


land the geoid agrees with the free ocean surface, exclud¬ 
ing the temporary perturbing effects of tides and winds. 
Over the continents the geoid is affected by the mass of 
land above mean sea level (Fig. 2.24a). The mass within 
the ellipsoid causes a downward gravitational attraction 
toward the center of the Earth, but a hill or mountain 
whose center of gravity is outside the ellipsoid causes an 
upward attraction. This causes a local elevation of the 
geoid above the ellipsoid. The displacement between the 
geoid and the ellipsoid is called a geoid undulation ; the 
elevation caused by the mass above the ellipsoid is a posi¬ 
tive undulation. 

24.5.1 Geoid undulations 

In computing the theoretical figure of the Earth the dis¬ 
tribution of mass beneath the ellipsoid is assumed to be 
homogeneous. A local excess of mass under the ellipsoid 
will deflect and strengthen gravity locally. The potential 
of the ellipsoid is achieved further from the center of the 
Earth. The equipotential surface is forced to warp 
upward while remaining normal to gravity. This gives a 
positive geoid undulation over a mass excess under the 
ellipsoid (Fig. 2.24b). Conversely, a mass deficit beneath 
the ellipsoid will deflect the geoid below the ellipsoid, 
causing a negative geoid undulation. As a result of the 
uneven topography and heterogeneous internal mass 
distribution of the Earth, the geoid is a bumpy equipoten¬ 
tial surface. 

The potential of the geoid is represented mathemati¬ 
cally by spherical harmonic functions that involve the asso¬ 
ciated Legendre polynomials (Box 2.3). These are more 
complicated than the ordinary Legendre polynomials 
used to describe the gravitational potential of the ellipsoid 
(Eqs. (2.49)—(2.51)). Until now we have only considered 


variation of the potential with distance r and with the co¬ 
latitude angle 6. This is an oversimplification, because 
density variations within the Earth are not symmetrical 
about the rotation axis. The geoid is an equipotential 
surface for the real density distribution in the Earth, and 
so the potential of the geoid varies with longitude as well 
as co-latitude. These variations are taken into account by 
expressing the potential as a sum of spherical harmonic 
functions, as described in Box 2.3. This representation of 
the geopotential is analogous to the simpler expression for 
the gravitational potential of the rotationally symmetric 
Earth using a series of Legendre polynomials (Eq. (2.51)). 

In modern analyses the coefficient of each term in 
the geopotential - similar to the coefficients J n in Eq. (2.51) 
- can be calculated up to a high harmonic degree. The 
terms up to a selected degree are then used to compute a 
model of the geoid and the Earth's gravity field. A combi¬ 
nation of satellite data and surface gravity measurements 
was used to construct Goddard Earth Model (GEM) 10. A 
global comparison between a reference ellipsoid with flat¬ 
tening 1/298.257 and the geoid surface computed from the 
GEM 10 model shows long-wavelength geoid undulations 
(Fig. 2.25). The largest negative undulation (-105 m) is in 
the Indian Ocean south of India, and the largest positive 
undulation (+73 m) is in the equatorial Pacific Ocean 
north of Australia. These large-scale features are too 
broad to be ascribed to shallow crustal or lithospheric 
mass anomalies. They are thought to be due to hetero¬ 
geneities that extend deep into the lower mantle, but their 
origin is not yet understood. 

2.4.6 Satellite geodesy 

Since the early 1960s knowledge of the geoid has been 
dramatically enhanced by the science of satellite geodesy. 
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Fig. 2.26 The retrograde precession of a satellite orbit causes the line of 
nodes (CN,, CN 2 ) to change position on successive equatorial crossings. 


The motions of artificial satellites in Earth orbits are 
influenced by the Earth’s mass distribution. The most 
important interaction is the simple balance between the 
centrifugal force and the gravitational attraction of the 
Earth’s mass, which determines the radius of the satel¬ 
lite’s orbit. Analysis of the precession of the Earth’s rota¬ 
tion axis (Section 2.3.4.4) shows that it is determined by 
the dynamical ellipticity H , which depends on the 
difference between the principal moments of inertia 
resulting from the rotational flattening. In principle, the 
gravitational attraction of an artificial satellite on the 
Earth’s equatorial bulge also contributes to the preces¬ 
sion, but the effect is too tiny to be measurable. However, 
the inverse attraction of the equatorial bulge on the satel¬ 
lite causes the orbit of the satellite to precess around the 
rotation axis. The plane of the orbit intersects the equato¬ 
rial plane in the line of nodes. Let this be represented by 
the line CNj in Fig. 2.26. On the next passage of the satel¬ 
lite around the Earth the precession of the orbit has 
moved the nodal line to a new position CN 2 . The orbital 
precession in this case is retrograde', the nodal line 
regresses. For a satellite orbiting in the same sense as the 
Earth’s rotation the longitude of the nodal line shifts 
gradually westward; if the orbital sense is opposite to the 
Earth’s rotation the longitude of the nodal line shifts 
gradually eastward. Because of the precession of its orbit 
the path of a satellite eventually covers the entire Earth 
between the north and south circles of latitude defined by 
the inclination of the orbit. The profusion of high-quality 
satellite data is the best source for calculating the dynami¬ 
cal ellipticity or the related parameter J 2 in the gravity 
potential. Observations of satellite orbits are so precise 
that small perturbations of the orbit can be related to the 
gravitational field and to the geoid. 




Fig. 2.27 Changes in the arc distance between satellite laser-ranging 
(SLR) stations in Australia and Hawaii determined from LAGEOS 
observations over a period of four years. The mean rate of convergence, 
63 ± 3 mm yr 1 , agrees well with the rate of 67 mm yr 1 deduced from 
plate tectonics (after Tapley ef a/., 198S). 


2.4.6.1 Satellite laser-ranging 

The accurate tracking of a satellite orbit is achieved by 
satellite laser-ranging (SLR). The spherical surface of the 
target satellite is covered with numerous retro-reflectors. 
A retro-reflector consists of three orthogonal mirrors that 
form the corner of a cube; it reflects an incident beam of 
light back along its path. A brief pulse of laser light with 
a wavelength of 532 nm is sent from the tracking station 
on Earth to the satellite, and the two-way travel-time of 
the reflected pulse is measured. Knowing the speed of 
light, the distance of the satellite from the tracking station 
is obtained. The accuracy of a single range measurement 
is about 1 cm. 

America’s Laser Geodynamics Satellite (LAGEOS 1) 
and France’s Storiette satellite have been tracked for 
many years. LAGEOS 1 flies at 5858-5958 km altitude, 
the inclination of its orbit is 110° (i.e., its orbital sense is 
opposite to the Earth’s rotation), and the nodal line of the 
orbit advances at 0.343° per day. Starlette flies at an alti¬ 
tude of 806-1108 km, its orbit is inclined at 50°, and its 
nodal line regresses at 3.95° per day. 
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Fig. 2.28 The mean sea 
surface as determined from 
SEASAT and GEOS-3 satellite 
altimetry, after removal of 
long-wavelength features of 
the GEM-1 OB geoid up to 
order and degree 12 (from 
Marsh eta/., 1992). The 
surface is portrayed as though 
illuminated from the 
northwest. 



The track of a satellite is perturbed by many factors, 
including the Earth’s gravity field, solar and lunar tidal 
effects, and atmospheric drag. The perturbing influences 
of these factors can be computed and allowed for. For the 
very high accuracy that has now been achieved in SLR 
results, variations in the coordinates of the tracking sta¬ 
tions become detectable. The motion of the pole of rota¬ 
tion of the Earth can be deduced and the history of 
changes in position of the tracking station can be 
obtained. LAGEOS 1 was launched in 1976 and has been 
tracked by more than twenty laser-tracking stations on 
five tectonic plates. The relative changes in position 
between pairs of stations can be compared with the rates 
of plate tectonic motion deduced from marine geophysi¬ 
cal data. For example, a profile from the Yaragadee track¬ 
ing station in Australia and the tracking station in Hawaii 
crosses the converging plate boundary between the Indo- 
Australian and Pacific plates (Fig. 2.27). The results of 
four years of measurement show a decrease of the arc dis¬ 
tance between the two stations at a rate of 63 ± 3 mm yr -1 . 
This is in good agreement with the corresponding rate of 
67 mm yr -1 inferred from the relative rotation of the tec¬ 
tonic plates. 

2.4.62 Satellite altimetry 

From satellite laser-ranging measurements the altitude of 
a spacecraft can be determined relative to the reference 
ellipsoid with a precision in the centimeter range. In satel¬ 
lite altimetry the tracked satellite carries a transmitter and 
receiver of microwave (radar) signals. A brief electromag¬ 
netic pulse is emitted from the spacecraft and reflected 
from the surface of the Earth. The two-way travel-time is 
converted using the speed of light to an estimate of the 
height of the satellite above the Earth’s surface. The 
difference between the satellite’s height above the ellipsoid 
and above the Earth’s surface gives the height of the 
topography relative to the reference ellipsoid. The preci¬ 
sion over land areas is poorer than over the oceans, but 
over smooth land features like deserts and inland water 
bodies an accuracy of better than a meter is achievable. 


Satellite altimeters are best suited for marine surveys, 
where sub-meter accuracy is possible. The satellite 
GEOS-3 flew from 1975-1978, SEASAT was launched in 
1978, and GEOSAT was launched in 1985. Specifically 
designed for marine geophysical studies, these satellite 
altimeters revealed remarkable aspects of the marine 
geoid. The long-wavelength geoid undulations (Fig. 2.25) 
have large amplitudes up to several tens of meters and are 
maintained by mantle-wide convection. The short-wave¬ 
length features are accentuated by removing the com¬ 
puted geoid elevation up to a known order and degree. 
The data are presented in a way that emphasizes the ele¬ 
vated and depressed areas of the sea surface (Fig. 2.28). 

There is a strong correlation between the short-wave¬ 
length anomalies in elevation of the mean sea surface and 
features of the sea-floor topography. Over the ocean ridge 
systems and seamount chains the mean sea surface (geoid) 
is raised. The locations of fracture zones, in which one side 
is elevated relative to the other, are clearly discernible. Very 
dark areas mark the locations of deep ocean trenches, 
because the mass deficiency in a trench depresses the geoid. 
Seaward of the deep ocean trenches the mean sea surface is 
raised as a result of the upward flexure of the lithosphere 
before it plunges downward in a subduction zone. 

2.4.63 Satellite-based global positioning systems (GPS) 

Geodesy, the science of determining the three-dimen¬ 
sional coordinates of a position on the surface of the 
Earth, received an important boost with the advent of 
the satellite era. The first global satellite navigation 
system, the US Navy Navigation Satellite System known 
as TRANSIT consisted of six satellites in polar orbits 
about 1100 km above the surface of the Earth. Signals 
transmitted from these satellites were combined in a 
receiver on Earth with a signal generated at the same fre¬ 
quency in the receiver. Because of the motion of the 
satellite the frequency of its signal was modified by the 
Doppler effect and was thus slightly different from 
the receiver-generated signal, producing a beat fre¬ 
quency. Using the speed of light, the beat signal was 
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Fig. 2.29 Annual 
displacement rates in 
southeastern Italy, the Ionian 
Islands and western Greece 
relative to Matera (Italy), 
determined from GPS surveys 
in 1989 and 1992. The 
displacement arrows are 
much larger than the 
measurement errors, and 
indicate a distinct 
southwestward movement of 
western Greece relative to 
Italy (after Kahleefa/., 1995). 



converted to the oblique distance between the satellite 
and receiver. By integrating the beat signal over a chosen 
time interval the change in range to the satellite in the 
interval was obtained. This was repeated several times. 
The orbit of the satellite was known precisely from track¬ 
ing stations on the ground, and so the position of the 
receiver could be calculated. Originally developed to 
support ballistic missile submarines in the 1960s, the 
system was extended to civilian navigation purposes, 
especially for fixing the position of a ship at sea. The 
TRANSIT program was terminated in 1996, and suc¬ 
ceeded by the more precise GPS program. 

The Navigation Satellite Timing and Ranging Global 
Positioning System (NAVSTAR GPS, or, more com¬ 
monly, just GPS) utilizes satellites in much higher orbits, 
at an altitude of around 20,200 km (i.e., a radial distance 
of 26,570 km), with an orbital period of half a sidereal 
day. The GPS system consists of 24 satellites. There are 
four satellites in each of six orbital planes, equally spaced 
at 60° intervals around the equator and inclined to the 
equator at about 55°. Between five and eight GPS satel¬ 
lites are visible at any time and location on Earth. Each 
satellite broadcasts its own predetermined position and 
reference signal every six seconds. The time difference 
between emission and reception on Earth gives the 
“pseudo-range” of the satellite, so-called because it must 
be corrected for errors in the clock of the receiver and for 
tropospheric refraction. Pseudo-range measurements to 
four or more satellites with known positions allows com¬ 
putation of the clock error and the exact position of the 
receiver. The precision with which a point can be located 


depends on the quality of the receiver and signal process¬ 
ing. Low-cost single civilian-quality receivers have about 
100 m positioning accuracy. In scientific and military 
missions a roving receiver is used in conjunction with 
a base station (a fixed receiver), and differential signal 
processing improves the accuracy of location to around 
1 cm. 

The GPS system allows very precise determination of 
changes in the distance between observation points. For 
example, a dense network of GPS measurements was 
made in southeastern Italy, the Ionian Islands and 
western Greece in 1989 and 1993. The differences between 
the two measuring campaigns show that southwestern 
Greece moved systematically to the southwest relative to 
Matera in southeastern Italy at mean annual rates of 
20-40 mm yr -1 (Fig. 2.29). 

2.4.64 Measurement of gravity and the geoid from orbiting 
satellites 

The equipotential surface of gravity, the geoid (Section 
2.4.5), is characterized by undulations caused by inhomo¬ 
geneous distribution of mass in the Earth. Until recently, 
construction of a global model of the geoid was very 
laborious, as it required combining data from many 
different sources of variable precision. Surface gravity 
measurements made on land or at sea were augmented by 
data from a large number of Earth-orbiting satellites. The 
resulting figure showed large-scale features (Fig. 2.25), 
but fine details were impossible to define accurately. 
Satellites in comparatively low orbits, a few hundreds of 
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kilometers above the Earth’s surface, can now be used in 
conjunction with the GPS satellites orbiting at high alti¬ 
tudes (20,200 km) to measure the global gravity field and 
geoid with a precision that is several orders of magnitude 
better than was previously possible. 

In 2000 the German CHAMP (Challenging Mini-satel¬ 
lite Payload) satellite was inserted into a nearly circular, 
almost polar orbit with an initial altitude of 450 km. At 
this altitude the thin atmosphere is still capable of exerting 
drag, which lowers the altitude of the satellite to about 
300 km over a 5 year interval. Sensitive accelerometers on 
board the satellite allow correction for non-gravitational 
forces, such as atmospheric drag or the pressure of solar 
radiation. A highly precise GPS receiver on board the 
CHAMP satellite, using position data from up to 12 GPS 
satellites simultaneously, allows retrieval of CHAMP’s 
position with an accuracy of a few centimeters. Whereas 
the orbits of earlier satellites were compiled from many 
comparatively short tracks measured when the satellite 
was in view of different ground stations, the CHAMP 
orbit is continuously tracked by the GPS satellites. Small 
perturbations of CHAMP’s orbit may be tracked and 
modelled. The models of the Earth’s gravity field and of 
the global geoid derived from CHAMP data were greatly 
improved in accuracy and definition over previous models. 

Building on the experience gained from CHAMP, a 
joint American-German project, the Gravity Recovery 
and Climate Experiment (GRACE), was launched in 
2002. The GRACE mission uses two nearly identical 
satellites in near-circular polar orbits (inclination 89.5° to 
the equator), initially about 500 km above Earth’s surface. 
The twin satellites each carry on-board GPS receivers, 
which allow precise determination of their absolute posi¬ 
tions over the Earth at any time. The satellites travel in 
tandem in the same orbital plane, separated by approxi¬ 
mately 220 km along their track. Changes in gravity along 
the orbit are determined by observing small differences in 
the separation of the two satellites. This is achieved by 
using a highly accurate microwave ranging system. Each 
satellite carries a microwave antenna transmitting in the 
K-band frequency range (wavelength ~ 1 cm) and 
directed accurately at the other satellite. With this ranging 
system the separation of the two satellites can be mea¬ 
sured with a precision of one micrometer (1 p,m). 

As the satellite-pair orbits the Earth, it traverses varia¬ 
tions in the gravity field due to the inhomogeneous mass 
distribution in the Earth. If there is a mass excess, the 
equipotential surface bulges upward, and gravity is 
enhanced locally. The leading satellite encounters this 
anomaly first and is accelerated away from the trailing 
satellite. Tiny changes in separation between the two satel¬ 
lites as they move along-track are detected by the accurate 
microwave ranging system. In conjunction with exact 
location of the satellite by the on-board GPS devices, the 
GRACE satellites provide fine-scale definition of the 
gravity field, and determination of the geoid from a single 
source. Moreover, the satellites measure the gravity field 


completely in about 30 days. Thus, comparison of data 
from selected surveys of a region can reveal very small, 
time-dependent changes in gravity resulting, for example, 
from transient effects such as changes in groundwater 
level, or the melting of glaciers, in the observed region. 
Other instruments on board the GRACE satellites make 
further observations for atmospheric and ionospheric 
research. 

24.6.5 Observation of crustal deformation with satellite- 
borne radar 

Among the many satellites in Earth orbit, some (identi¬ 
fied by acronyms such as ERS1, ERS2, JERS, IRS, 
RADARSAT, Envisat, etc.) are specifically designed to 
direct beams of radar waves at the Earth and record the 
reflections from the Earth’s surface. Synthetic aperture 
radar (SAR) is a remote sensing technique that has made 
it possible to record features of the Earth’s surface in 
remarkable detail based on these radar reflections. In a 
typical SAR investigation enormous amounts of radar 
data are gathered and subjected to complex data-process- 
ing. This requires massive computational power, and so is 
usually performed on the ground after the survey has 
been carried out. 

Radar signals, like visible light, are subject to reflec¬ 
tion, refraction and diffraction (these phenomena are 
explained in Section 3.6.2 for seismic waves). Diffraction 
(see Fig. 3.55) bends light passing through a lens in such a 
way that a point source becomes diffuse. When two adja¬ 
cent point sources are observed with the lens, their diffuse 
images overlap and if they are very close, they may not be 
seen as distinct points. The resolving power of an optical 
instrument, such as a lens, is defined by the smallest 
angular separation (0) of two points that the instrument 
can distinguish clearly. For a given lens this angle is 
dependent inversely on the diameter (d) of the aperture 
that allows light to pass through the lens, and directly on 
the wavelength (A) of the light. It is given by the approxi¬ 
mate relationship 9~\ld. High resolution requires that 
closely spaced details of the object can be distinguished, 
i.e., the angular resolution 9 should be a small number. 
Thus, the larger the aperture of the lens, the higher is the 
optical resolution. 

The same principle applies to radar. Instead of being 
dependent on the diameter of an optical lens, the resolu¬ 
tion of a radar system is determined by the antenna 
length. When mounted in a satellite, the physical dimen¬ 
sions of the antenna are limited to just a few meters. SAR 
makes use of the motion of the antenna and powerful 
data-processing to get around this limitation. 

The radar antenna is mounted so that it directs its beam 
at right angles to the direction of motion of the host space¬ 
craft. The beam "illuminates” a swathe of ground surface, 
each particle of which reflects a signal to the antenna. 
Hundreds of radar pulses are sent out per second (e.g., the 
European Radar Satellites (ERS) emit 1700 pulses per 
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second); this results in huge amounts of reflected signals. 
As the craft moves forward, the illuminated swathe moves 
across the target surface. Each particle of the target reflects 
hundreds of radar pulses from the time when it is first ener¬ 
gized until it is no longer covered by the beam. During this 
time the craft (and real antenna) move some distance along 
the track. In subsequent data-processing the signals 
reflected from the target are combined and corrected for 
the changing position of the antenna in such a way that 
they appear to have been gathered by an antenna as long as 
the distance moved along the track. This distance is called 
the synthetic aperture of the radar. For example, a SAR 
investigation with an ERS1 satellite in orbit 800 km above 
the Earth’s surface created a synthetic aperture of about 
4 km. The high resolving power achieved with this large 
aperture produced SAR images of ground features with a 
resolution of about 30 m. 

An important aspect of the data reduction is the 
ability to reconstruct the path of each reflection precisely. 
This is achieved using the Doppler effect, the principle of 
which is described in Box 1.2. Reflections from target 
features ahead of the moving spacecraft have elevated 
frequencies; those from behind have lowered frequencies. 
Correcting the frequency of each signal for its Doppler 
shift is necessary to obtain the true geometry of the 
reflections. 

A further development of the SAR method is 
Interferometric SAR (InSAR). This technique analyzes 
the phase of the reflected radar signal to determine small 
changes in topography between repeated passages of the 
satellite over an area. The phase of the wave is a measure 
of the time-delay the wave experiences in transit between 
transmitter and receiver. To illustrate this point, picture 
the shape of a waveform as an alternation of crests and 
troughs, which leaves the satellite at the instant its ampli¬ 
tude is maximum (i.e., at a crest). If the reflected signal 
returns to the satellite as a crest, it has the same phase as 
the transmitted signal. Its amplitude could be expressed 
by the equation y = A cos cot. This will be the case if the 
path of the reflected wave to and from the target is an 
exact number of complete wavelengths. On the other 
hand, if the reflection arrives back at the satellite as a 
trough, it is exactly out-of-phase with the original wave. 
This happens when the length of its path is an odd 
number of half-wavelengths. More generally, the path 
length is not an exact even or odd number of half¬ 
wavelengths, and the equation of the reflected wave must 
be written y = A cos(cot + A), where the phase difference A 
depends on the path length. The InSAR technique devel¬ 
oped in the 1990s is based on analysis of the phases inher¬ 
ent in each reflection recorded by the satellite. 

If a SAR image is made of a target area during one 
orbit, it should be reproduced exactly on a later orbit that 
revisits the same location (this is not exactly possible, but 
paths that repeat within a few hundred meters can be cor¬ 
rected for geometric differences). In particular, because 
each point of the target is the same distance from the trans- 



Fig. 2.30 Interferometric Synthetic Aperture Radar (InSAR) pattern of 
interference fringes showing changes in elevation of Mount Etna, Sicily, 
following the 1992-1993 eruptive cycle. The four pairs of light and dark 
fringes correspond to subsidence of the mountaintop by about 11 cm as 
magma drains out of the volcano (after Massonnet, 1997). 

mitter, the phases of the imaged signals should be identi¬ 
cal. However, if geological events have caused surface dis¬ 
placements between the times of the two images there will 
be phase differences between the two images. These are 
made visible by combining the two images so that they 
interfere with each other. 

When harmonic signals with different phases are 
mixed, they interfere with each other. Constructive inter¬ 
ference occurs when the signals have the same phase; if 
they are superposed, the combined signal is strengthened. 
Destructive interference occurs when out-of-phase 
signals are mixed; the combined signal is weakened. The 
interference pattern that results from mixing the two 
waveforms consists of alternating zones of reinforcement 
and reduction of the signal, forming a sequence of so- 
called “interference fringes.” The use of color greatly 
enhances the visual impact of the interference fringes. 

When this procedure is carried out with SAR images, 
the resulting interference pattern makes it possible to 
interpret ground motion of large areas in much greater 
detail than would be possible from ground-based obser¬ 
vations. The method has been used to record various 
large-scale ground displacements related, for example, to 
earthquakes, tectonic faulting, and volcanism. Figure 
2.30 shows an interference pattern superposed on the 
background topography of Mount Etna, in Sicily, follow¬ 
ing a cycle of eruptions in 1992 and 1993. Successive 
radar images from a common vantage point were 
obtained 13 months apart by the ERS1 satellite, which 
transmitted radar signals with wavelength 5.66 cm. In 
order to change the along-path distance to and from the 
target by a full wavelength, the ground must move by a 
half-wavelength perpendicular to the path, in this case by 
2.83 cm. The concentric dark and light fringes around the 
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Fig. 2.31 Fine-scale fluctuations in the LOD observed by VLBI, and LOD 
variations expected from changes in the angular momentum of the 
atmosphere (after Carter, 1989). 

crater show four cycles of interference, corresponding to 
a change in elevation of the mountaintop of about 11 cm. 
The fringes result from the subsidence of the crater as 
magma drained out of it following the eruptive cycle. 

24.6.6 Very long baseline interferometry 

Extra-galactic radio sources (quasars) form the most 
stable inertial coordinate system yet known for geodetic 
measurements. The extra-galactic radio signals are 
detected almost simultaneously by radio-astronomy 
antennas at observatories on different continents. Know¬ 
ing the direction of the incoming signal, the small differ¬ 
ences in times of arrival of the signal wavefronts at the 
various stations are processed to give the lengths of the 
baselines between pairs of stations. This highly precise 
geodetic technique, called Very Long Baseline Inter¬ 
ferometry (VLBI), allows determination of the separa¬ 
tion of observatories several thousand kilometers apart 
with an accuracy of a few centimeters. Although not 
strictly a satellite-based technique, it is included in this 
section because of its use of non-terrestrial signals for 
high resolution geodetic measurements. 

By combining VLBI observations from different sta¬ 
tions the orientation of the Earth to the extra-galactic 
inertial coordinate system of the radio sources is 
obtained. Repeated determinations yield a record of the 
Earth’s orientation and rotational rate with unprece¬ 
dented accuracy. Motion of the rotation axis (e.g., the 
Chandler wobble, Section 2.3.4.3) can be described opti¬ 
cally with a resolution of 0.5-1 m; the VLBI data have an 
accuracy of 3-5 cm. The period of angular rotation can 
be determined to better than 0.1 millisecond. This has 
enabled very accurate observation of irregularities in the 
rotational rate of the Earth, which are manifest as 
changes in the length of the day (LOD). 

The most important, first-order changes in the LOD are 
due to the braking of the Earth’s rotation by the lunar and 
solar marine tides (Section 2.3.4.1). The most significant 



Fig. 2.32 High-frequency changes in the LOD after correction for the 
effects due to atmospheric angular momentum (points) and the 
theoretical variations expected from the solid bodily Earth-tides (after 
Carter, 1989). 

non-tidal LOD variations are associated with changes in 
the angular momentum of the atmosphere due to shifts in 
the east-west component of the wind patterns. To conserve 
the total angular momentum of the Earth a change in the 
angular momentum of the atmosphere must be compen¬ 
sated by an equal and opposite change in the angular 
momentum of the crust and mantle. The largely seasonal 
transfers of angular momentum correlate well with high- 
frequency variations in the LOD obtained from VLBI 
results (Fig. 2.31). 

If the effects of marine tidal braking and non-tidal 
transfers of atmospheric angular momentum variations are 
taken into account, small residual deviations in the LOD 
remain. These are related to the tides in the solid Earth 
(Section 2.3.3.5). The lunar and solar tidal forces deform 
the Earth elastically and change its ellipticity slightly. The 
readjustment of the mass distribution necessitates a corre¬ 
sponding change in the Earth’s rate of rotation in order to 
conserve angular momentum. The expected changes in 
LOD due to the influence of tides in the solid Earth can be 
computed. The discrepancies in LOD values determined 
from VLBI results agree well with the fluctuations pre¬ 
dicted by the theory of the bodily Earth-tides (Fig. 2.32). 

2.5 GRAVITY ANOMALIES 

2.5.1 Introduction 

The mean value of gravity at the surface of the Earth is 
approximately 9.80ms -2 , or 980,000 mgal. The Earth’s 
rotation and flattening cause gravity to increase by roughly 
5300 mgal from equator to pole, which is a variation of 
only about 0.5%. Accordingly, measurements of gravity 
are of two types. The first corresponds to determination of 
the absolute magnitude of gravity at any place; the second 
consists of measuring the change in gravity from one place 
to another. In geophysical studies, especially in gravity 
prospecting, it is necessary to measure accurately the small 
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changes in gravity caused by underground structures. 
These require an instrumental sensitivity of the order of 
0.01 mgal. It is very difficult to design an instrument to 
measure the absolute value of gravity that has this high 
precision and that is also portable enough to be used easily 
in different places. Gravity surveying is usually carried out 
with a portable instrument called a gravimeter , which 
determines the variation of gravity relative to one or more 
reference locations. In national gravity surveys the relative 
variations determined with a gravimeter may be converted 
to absolute values by calibration with absolute measure¬ 
ments made at selected stations. 


2.5.2 Absolute measurement of gravity 

The classical method of measuring gravity is with a pen¬ 
dulum. A simple pendulum consists of a heavy weight 
suspended at the end of a thin fiber. The compound (or 
reversible) pendulum, first described by Henry Kater in 
1818, allows more exact measurements. It consists of a 
stiff metal or quartz rod, about 50 cm long, to which is 
attached a movable mass. Near each end of the rod is 
fixed a pivot, which consists of a quartz knife-edge resting 
on a flat quartz plane. The period of the pendulum is 
measured for oscillations about one of the pivots. The 
pendulum is then inverted and its period about the other 
pivot is determined. The position of the movable mass is 
adjusted until the periods about the two pivots are equal. 
The distance L between the pivots is then measured accu¬ 
rately. The period of the instrument is given by 



where / is the moment of inertia of the pendulum about a 
pivot, h is the distance of the center of mass from the 
pivot, and m is the mass of the pendulum. Knowing the 
length L from Kater's method obviates knowledge of /, m 
and h. 

The sensitivity of the compound pendulum is found 
by differentiating Eq. (2.60). This gives 


A g = A T 
g T 


(2.61) 


To obtain a sensitivity of about 1 mgal it is necessary to 
determine the period with an accuracy of about 0.5 p,s. This 
can be achieved easily today with precise atomic clocks. The 
compound pendulum was the main instrument for gravity 
prospecting in the 1930s, when timing the swings precisely 
was more difficult. It was necessary to time as accurately as 
possible a very large number of swings. As a result a single 
gravity measurement took about half an hour. 

The performance of the instrument was handicapped 
by several factors. The inertial reaction of the housing to 
the swinging mass of the pendulum was compensated by 
mounting two pendulums on the same frame and swing¬ 
ing them in opposite phase. Air resistance was reduced by 


housing the pendulum assemblage in an evacuated ther¬ 
mostatically controlled chamber. Friction in the pivot was 
minimized by the quartz knife-edge and plane, but due to 
minor unevenness the contact edge was not exactly repeat- 
able if the assemblage was set up in a different location, 
which affected the reliability of the measurements. The 
apparatus was bulky but was used until the 1950s as the 
main method of making absolute gravity measurements. 

25.2.1 Free-fall method 

Modern methods of determining the acceleration of 
gravity are based on observations of falling objects. For 
an object that falls from a starting position : Q with initial 
velocity u the equation of motion gives the position z at 
time t as 

z = z 0 + ut + ^ gt 2 (2.62) 

The absolute value of gravity is obtained by fitting a qua¬ 
dratic to the record of position versus time. 

An important element in modern experiments is the 
accurate measurement of the change of position with a 
Michelson interferometer. In this device a beam of mono¬ 
chromatic light passes through a beam splitter, consisting 
of a semi-silvered mirror, which reflects half of the light 
incident upon it and transmits the other half. This divides 
the incident ray into two subrays, which subsequently 
travel along different paths and are then recombined to 
give an interference pattern. If the path lengths differ by a 
full wavelength (or a number of full wavelengths) of the 
monochromatic light, the interference is constructive. The 
recombined light has maximum intensity, giving a bright 
interference fringe. If the path lengths differ by half a 
wavelength (or by an odd number of half-wavelengths) the 
recombined beams interfere destructively, giving a dark 
fringe. In modern experiments the monochromatic light 
source is a laser beam of accurately known wavelength. 

In an absolute measurement of gravity a laser beam is 
split along two paths that form a Michelson interferome¬ 
ter (Fig. 2.33). The horizontal path is of fixed length, 
while the vertical path is reflected off a corner-cube retro- 
reflector that is released at a known instant and falls 
freely. The path of free-fall is about 0.5 m long. The cube 
falls in an evacuated chamber to minimize air resistance. 
A photo-multiplier and counter permit the fringes within 
any time interval to be recorded and counted. The inten¬ 
sity of the recombined light fluctuates sinusoidally with 
increasing frequency the further and faster the cube falls. 
The distance between each zero crossing corresponds to 
half the wavelength of the laser light, and so the distance 
travelled by the falling cube in any time interval may be 
obtained. The times of the zero crossings must be mea¬ 
sured with an accuracy of 0.1 ns (10 -10 s) to give an accu¬ 
racy of 1 p-gal in the gravity measurement. 

Although the apparatus is compact, it is not quite 
portable enough for gravity surveying. It gives measure- 
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Fig. 2.33 The free-fall 
method of measuring 
absolute gravity. 
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Fig. 2.34 The rise-and-fall method of measuring absolute gravity. 

merits of the absolute value of gravity with an accuracy of 
about 0.005-0.010 mgal (5-10 jjtgal). A disadvantage of 
the free-fall method is the resistance of the residual air 
molecules left in the evacuated chamber. This effect is 
reduced by placing the retro-reflector in a chamber that 
falls simultaneously with the cube, so that in effect the 
cube falls in still air. Air resistance is further reduced in 
the rise-and-fall method. 

252.2 Rise-and-fall method 

In the original version of the rise-and-fall method a glass 
sphere was fired vertically upward and fell back along the 
same path (Fig. 2.34). Timing devices at two different levels 


registered the times of passage of the ball on the upward 
and downward paths. In each timer a light beam passed 
through a narrow slit. As the glass sphere passed the slit it 
acted as a lens and focussed one slit on the other. A photo¬ 
multiplier and detector registered the exact passage of the 
ball past the timing level on the upward and downward 
paths. The distance h between the two timing levels (around 
1 m) was measured accurately by optical interferometry. 

Let the time spent by the sphere above the first timing 
level be T l and the time above the second level be T 2 ; 
further, let the distances from the zenith level to the 
timing levels be z { and z 2 , respectively. The corresponding 
times of fall are t l = Tj/2 and t-, = TJ2. Then, 

< 2 -«> 

with a similar expression for the second timing level. 
Their separation is 

h = : \- : 2 = \g( T \- T 2) (2.64) 

The following elegantly simple expression for the value of 
gravity is obtained: 

8 ( T\ - T\) (2 ' 65) 

Although the experiment is conducted in a high 
vacuum, the few remaining air molecules cause a drag 
that opposes the motion. On the upward path the air drag 
is downward, in the same direction as gravity; on the 
downward path the air drag is upward, opposite to the 
direction of gravity. This asymmetry helps to minimize 
the effects of air resistance. 

In a modern variation Michelson interferometry is used 
as in the free-fall method. The projectile is a corner-cube 
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Fig. 2.35 The principle of operation of an unstable (astatic) type of 
gravimeter. 

retro-reflector, and interference fringes are observed and 
counted during its upward and downward paths. Sensi¬ 
tivity and accuracy are comparable to those of the free-fall 
method. 

2.5.3 Relative measurement of gravity: the gravimeter 

In principle, a gravity meter, or gravimeter , is a very sensi¬ 
tive balance. The first gravimeters were based on the 
straightforward application of Hooke’s law (Section 
3.2.1). A mass m suspended from a spring of length s 0 
causes it to stretch to a new length s. The extension, or 
change in length, of the spring is proportional to the 
restoring force of the spring and so to the value of gravity, 
according to: 

F= mg = — k(s — ,s' 0 ) (2.66) 

where k is the elastic constant of the spring. The gravime¬ 
ter is calibrated at a known location. If gravity is different 
at another location, the extension of the spring changes, 
and from this the change in gravity can be computed. 

This type of gravimeter, based directly on Hooke’s law, 
is called a stable type. It has been replaced by more sensi¬ 
tive unstable or astatized types, which are constructed so 
that an additional force acts in the same direction as 
gravity and opposes the restoring force of the spring. The 
instrument is then in a state of unstable equilibrium. This 



Fig. 2.36 Compensation of gravity readings for instrumental drift. The 
gravity stations B-T are occupied in sequence at known times. The 
repeated measurements at the base station B allow a drift correction to 
be made to the gravity readings at the other stations. 

condition is realized through the design of the spring. If 
the natural length 5 0 can be made as small as possible, 
ideally zero, Eq. (2.66) shows that the restoring force is 
then proportional to the physical length of the spring 
instead of its extension. The zero-length spring , first intro¬ 
duced in the LaCoste-Romberg gravimeter, is now a 
common element in modern gravimeters. The spring is 
usually of the helical type. When a helical spring is 
stretched, the fiber of the spring is twisted; the total twist 
along the length of the fiber equals the extension of the 
spring as a whole. During manufacture of a zero-length 
spring the helical spring is given an extra twist, so that its 
tendency is to uncoil. An increase in gravity stretches the 
spring against its restoring force, and the extension is aug¬ 
mented by the built-in pre-tension. 

The operation of a gravimeter is illustrated in Fig. 
2.35. A mass is supported by a horizontal rod to which a 
mirror is attached. The position of the rod is observed 
with a light-beam reflected into a microscope. If gravity 
changes, the zero-length spring is extended or shortened 
and the position of the rod is altered, which deflects the 
light-beam. The null-deflection principle is utilized. An 
adjusting screw changes the position of the upper attach¬ 
ment of the spring, which alters its tension and restores 
the rod to its original horizontal position as detected by 
the light-beam and microscope. The turns of the adjust¬ 
ing screw are calibrated in units of the change in gravity, 
usually in mgal. 

The gravimeter is light, robust and portable. After ini¬ 
tially levelling the instrument, an accurate measurement 
of a gravity difference can be made in a few minutes. The 
gravimeter has a sensitivity of about 0.01 mgal (10 p,gal). 
This high sensitivity makes it susceptible to small changes 
in its own properties. 
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2.53.1 Gravity surveying 

If a gravimeter is set up at a given place and monitored for 
an hour or so, the repeated readings are found to vary 
smoothly with time. The changes amount to several hun¬ 
dredths of a mgal. The instrumental drift is partly due to 
thermally induced changes in the elastic properties of the 
gravimeter spring, which are minimized by housing the 
critical elements in an evacuated chamber. In addition, 
the elastic properties of the spring are not perfect, but 
creep slowly with time. The effect is small in modern 
gravimeters and can be compensated by making a drift 
correction. This is obtained by repeated occupation of 
some measurement stations at intervals during the day 
(Fig. 2.36). Gravity readings at other stations are adjusted 
by comparison with the drift curve. In order to make this 
correction the time of each measurement must be noted. 

During the day, while measurements are being made, 
the gravimeter is subject to tidal attraction, including 
vertical displacement due to the bodily Earth-tides. The 
theory of the tides is known well (see Section 2.3.3) and 
their time-dependent effect on gravity can be computed 
precisely for any place on Earth at any time. Again, the 
tidal correction requires that the time of each measure¬ 
ment be known. 

The goal of gravity surveying is to locate and describe 
subsurface structures from the gravity effects caused by 
their anomalous densities. Most commonly, gravimeter 
measurements are made at a network of stations, spaced 
according to the purpose of the survey. In environmental 
studies a detailed high-resolution investigation of the 
gravity expression of a small area requires small distances 
of a few meters between measurement stations. In 
regional gravity surveys, as used for the definition of 
hidden structures of prospective commercial interest, the 
distance between stations may be several kilometers. If 
the area surveyed is not too large, a suitable site is selected 
as base station (or reference site), and the gravity 
differences between the surveyed sites and this site are 
measured. In a gravity survey on a national scale, the 
gravity differences may be determined relative to a site 
where the absolute value of gravity is known. 

2.5.4 Correction of gravity measurements 

If the interior of the Earth were uniform, the value of 
gravity on the international reference ellipsoid would vary 
with latitude according to the normal gravity formula 
(Eq. (2.56)). This provides us with a reference value for 
gravity measurements. In practice, it is not possible to 
measure gravity on the ellipsoid at the place where the ref¬ 
erence value is known. The elevation of a measurement 
station may be hundreds of meters above or below the 
ellipsoid. Moreover, the gravity station may be sur¬ 
rounded by mountains and valleys that perturb the mea¬ 
surement. For example, let P and Q represent gravity 
stations at different elevations in hilly terrain (Fig. 2.37a). 
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Fig. 2.37 After (a) terrain corrections, (b) the Bouguer plate correction, 
and (c) the free-air correction, the gravity measurements at stations P 
and Q can be compared to (d) the theoretical gravity at R on the 
reference ellipsoid. 


The theoretical value of gravity is computed at the points 
R on the reference ellipsoid below P and Q. Thus, we must 
correct the measured gravity before it can be compared 
with the reference value. 

The hill-top adjacent to stations P and Q has a center 
of mass that lies higher than the measurement elevation 
(Fig. 2.37a). The gravimeter measures gravity in the ver¬ 
tical direction, along the local plumb-line. The mass of 
the hill-top above P attracts the gravimeter and causes 
an acceleration with a vertically upward component at P. 
The measured gravity is reduced by the presence of the 
hill-top; to compensate for this a terrain (or topographic) 
correction is calculated and added to the measured 
gravity. A similar effect is observed at Q, but the hill-top 
above Q is smaller and the corresponding terrain correc¬ 
tion is smaller. These corrections effectively level the 
topography to the same elevation as the gravity station. 
The presence of a valley next to each measurement 
station also requires a terrain correction. In this case, 
imagine that we could fill the valley up to the level of 
each station with rock of the same density p as under P 
and Q. The downward attraction on the gravimeter 
would be increased, so the terrain correction for a valley 
must also be added to the measured gravity, just as for a 
hill. Removing the effects of the topography around a 
gravity station requires making positive terrain correc¬ 
tions (Ag x ) for both hills and valleys. 

After levelling the topography there is now a Active 
uniform layer of rock with density p between the gravity 
station and the reference ellipsoid (Fig. 2.37b). The gravi¬ 
tational acceleration of this rock-mass is included in the 



78 


Gravity, the figure of the Earth and geodynamics 


measured gravity and must be removed before we can 
compare with the theoretical gravity. The layer is taken to 
be a flat disk or plate of thickness /z p or /z Q under each 
station; it is called the Bouguer plate. Its gravitational 
acceleration can be computed for known thickness and 
density p, and gives a Bouguer plate correction (Ag Bp ) that 
must be subtracted from the measured gravity, if the 
gravity station is above sea-level. Note that, if the gravity 
station is below sea-level, we have to fill the space above it 
up to sea-level with rock of density p; this requires increas¬ 
ing the measured gravity correspondingly. The Bouguer 
plate correction (Ag Bp ) is negative if the station is above 
sea-level but positive if it is below sea-level. Its size 
depends on the density of the local rocks, but typically 
amounts to about 0.1 mgal m -1 . 

Finally, we must compensate the measured gravity for 
the elevation /z p or /z Q of the gravity station above the 
ellipsoid (Fig. 2.37c). The main part of gravity is due to 
gravitational attraction, which decreases proportionately 
to the inverse square of distance from the center of the 
Earth. The gravity measured at P or Q is smaller than it 
would be if measured on the ellipsoid at R. A free-air cor¬ 
rection (Ag FA ) for the elevation of the station must be 
added to the measured gravity. This correction ignores 
the effects of material between the measurement and ref¬ 
erence levels, as this is taken care of in Ag Bp . Note that, if 
the gravity station were below sea-level, the gravitational 
part of the measured gravity would be too large by com¬ 
parison with the reference ellipsoid; we would need 
to subtract Ag FA in this case. The free-air correction is 
positive if the station is above sea-level but negative if it is 
below sea-level (as might be the case in Death Valley or 
beside the Dead Sea). It amounts to about 0.3 mgal m -1 . 

The free-air correction is always of opposite sense to 
the Bouguer plate correction. For convenience, the two 
are often combined in a single elevation correction, which 
amounts to about 0.2 mgal m -1 . This must be added for 
gravity stations above sea-level and subtracted if gravity 
is measured below sea-level. In addition, a tidal correction 
(A&tide) must be made (Section 2.3.3), and, if gravity is 
measured in a moving vehicle, the Eotvos correction 
(Section 2.3.5) is also necessary. 

After correction the measured gravity can be com¬ 
pared with the theoretical gravity on the ellipsoid (Fig. 
2.37d). Note that the above procedure reduces the mea¬ 
sured gravity to the surface of the ellipsoid. In principle it 
is equally valid to correct the theoretical gravity from the 
ellipsoid upward to the level where the measurement was 
made. This method is preferred in more advanced types of 
analysis of gravity anomalies where the possibility of an 
anomalous mass between the ellipsoid and ground 
surface must be taken into account. 

2.5.4.1 Latitude correction 

The theoretical gravity at a given latitude is given by the 
normal gravity formula (Eq. 2.56). If the measured 




Fig. 2.38 Terrain corrections Ag T are made by (a) dividing the 
topography into vertical elements, (b) computing the correction for 
each cylindrical element according to its height above or below the 
measurement station, and (c) adding up the contributions for all 
elements around the station with the aid of a transparent overlay on a 
topographic map. 


gravity is an absolute value, the correction for latitude is 
made by subtracting the value predicted by this formula. 
Often, however, the gravity survey is made with a 
gravimeter, and the quantity measured, g m , is the gravity 
difference relative to a base station. The normal reference 
gravity g n may then be replaced by a latitude correction, 
obtained by differentiating Eq. (2.56): 

= g e (Pi sin2A + /3 2 sin4A) (2.67) 

After converting dA from radians to kilometers and 
neglecting the /3 2 term, the latitude correction (Ag lat ) is 
0.8140sin2A mgal per kilometer of north-south displace¬ 
ment. Because gravity decreases towards the poles, the 
correction for stations closer to the pole than the base 
station must be added to the measured gravity. 









2.5 GRAVITY ANOMALIES 


79 


2.54.2 Terrain corrections 


The terrain correction (Ag x ) for a hill adjacent to a gravity 
station is computed by dividing the hill into a number of 
vertical prisms (Fig. 2.38a). The contribution of each ver¬ 
tical element to the vertical acceleration at the point of 
observation P is calculated by assuming cylindrical sym¬ 
metry about P. The height of the prism is /;, its inner and 
outer radii are r x and r 2 , respectively, the angle subtended 
at P is cf) 0 , and the density of the hill is p (Fig. 2.38b). Let 
the sides of a small cylindrical element be dr, dr and r d<j>; 
its mass is dm = p r d<f> dr dr and its contribution to the 
upward acceleration caused by the prism at P is 


Ag=G 


dm 


(r 2 + r 2 ) 


cos# = G 


prdrd:d(f> 


(r 2 + r 2 ) ^(r 2 + z 2 ) 


( 2 . 68 ) 


Combining and rearranging terms and the order of inte¬ 
gration gives the upward acceleration at P due to the 
cylindrical prism: 


Ag x = Gpfdd>f(f (r2 ; d ; 2) 3/ 2 )'-dr (2.69) 

<t> = o r—r x Z= 0 

The integration over cj) gives </> o ; after further integration 
over r we get: 

^’ a <*\£{vwwr'Y' (r ™> 


mean topographic relief within each sector changes and 
must be computed anew. As a result, terrain corrections 
are time consuming and tedious. The most important 
effects come from the topography nearest to the station. 
Flowever, terrain corrections are generally necessary if a 
topographic difference within a sector is more than about 
5% of its distance from the station. 

2.54.3 Bouguerplate correction 

The Bouguer plate correction (Ag Bp ) compensates for the 
effect of a layer of rock whose thickness corresponds to 
the elevation difference between the measurement and 
reference levels. This is modelled by a solid disk of density 
p and infinite radius centered at the gravity station P. The 
correction is computed by extension of the calculation for 
the terrain correction. An elemental cylindrical prism is 
defined as in Fig. 2.38b. Let the angle (/> subtended by the 
prism increase to 2 tt and the inner radius decrease to 
zero; the first term in brackets in Eq. (2.71) reduces to /?. 
The gravitational acceleration at the center of a solid disk 
of radius r is then 

Ag x = 2ttGp( h — ( yjr 2 + h 2 — r )) (2.72) 

Now let the radius r of the disk increase. The value of h 
gradually becomes insignificant compared to r; in the limit, 
when r is infinite, the second term in Eq. (2.72) tends to 
zero. Thus, the Bouguer plate correction (Ag Bp ) is given by 


Integration over r gives the upward acceleration produced 
at P by the cylinder: 

A g T = Gpcf> 0 ((-\lr 2 + h 2 — rj — (s/r 2 + h 2 — r 7 )) (2.71) 

The direction of Ag x in Fig. 2.38b is upward, opposite to 
gravity; the corresponding terrain correction must be 
added to the measured gravity. 

In practice, terrain corrections can be made using a 
terrain chart (Fig. 2.38c) on which concentric circles and 
radial lines divide the area around the gravity station into 
sectors that have radial symmetry like the cross-section of 
the element of a vertical cylinder in Fig. 2.38b. The inner 
and outer radii of each sector correspond to r l and r 2 , 
and the angle subtended by the sector is <j>. The terrain 
correction for each sector within each zone is pre-calcu- 
lated using Eq. (2.71) and tabulated. The chart is drawn 
on a transparent sheet that is overlaid on a topographic 
map at the same scale and centered on the gravity station. 
The mean elevation within each sector is estimated as 
accurately as possible, and the elevation difference (i.e., h 
in Eq. (2.71)) of the sector relative to the station is com¬ 
puted. This is multiplied by the correction factor for the 
sector to give its contribution to the terrain correction. 
Finally, the terrain correction at the gravity station is 
obtained by summing up the contributions of all sectors. 
The procedure must be repeated for each gravity station. 
When the terrain chart is centered on a new station, the 


Ag Bp = 2'nGpli (2.73) 

Inserting numerical values gives 0.0419 X 10 -3 p mgal m -1 
for Ag Bp , where the density p is in kgm -3 (see Section 
2.5.5). The correct choice of density is very important in 
computing Ag Bp and Ag x . Some methods of determining 
the optimum choice are described in detail below. 

An additional consideration is necessary in marine 
gravity surveys. Ag Bp requires uniform density below the 
surface of the reference ellipsoid. To compute Ag Bp over an 
oceanic region we must in effect replace the sea-water with 
rock of density p. However, the measured gravity contains 
a component due to the attraction of the sea-water (density 
1030 kgm -3 ) in the ocean basin. The Bouguer plate correc¬ 
tion in marine gravity surveys is therefore made by replac¬ 
ing the density p in Eq. (2.73) by (p - 1030) kgm -3 . When a 
shipboard gravity survey is made over a large deep lake, a 
similar allowance must be made for the depth of water in 
the lake using an assumed density of (p - 1000) kgm -3 . 

2.544 Free-air correction 

The free-air correction (Ag FA ) has a rather colorful, but 
slightly misleading title, giving the impression that the 
measurement station is floating in air above the ellip¬ 
soid. The density of air at standard temperature and 
pressure is around 1.3 kgm -3 and a mass of air between 
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the observation and reference levels would cause a 
detectable gravity effect of about 50 |xgal at an elevation 
of 1000 m. In fact, the free-air correction pays no atten¬ 
tion to the density of the material between the measure¬ 
ment elevation and the ellipsoid. It is a straightforward 
correction for the decrease of gravitational acceleration 
with distance from the center of the Earth: 


dg 

dr 



(2.74) 


On substituting the Earth’s radius (6371 km) for r and 
the mean value of gravity (981,000mgal) for g, the value 
of Ag FA is found to be 0.3086 mgal m -1 . 


25.4.5 Combined elevation correction 

The free-air and Bouguer plate corrections are often com¬ 
bined into a single elevation correction, which is (0.3086 
- (0.0419pX 10 -3 )) mgal m _1 . Substituting a typical 
density for crustal rocks, usually taken to be 2670 kgm” 3 , 
gives a combined elevation correction of 0.197 mgal m -1 . 
This must be added to the measured gravity if the gravity 
station is above the ellipsoid and subtracted if it is below. 

The high sensitivity of modern gravimeters allows an 
achievable accuracy of 0.01-0.02 mgal in modern gravity 
surveys. To achieve this accuracy the corrections for the 
variations of gravity with latitude and elevation must be 
made very exactly. This requires that the precise coordi¬ 
nates of a gravity station must be determined by accurate 
geodetic surveying. The necessary precision of horizontal 
positioning is indicated by the latitude correction. This is 
maximum at 45° latitude, where, in order to achieve a 
survey accuracy of± 0.01 mgal, the north-south posi¬ 
tions of gravity stations must be known to about ±10m. 
The requisite precision in vertical positioning is indicated 
by the combined elevation correction of 0.2 mgal m -1 . To 
achieve a survey accuracy of ±0.01 mgal the elevation of 
the gravimeter above the reference ellipsoid must be 
known to about ±5 cm. 

The elevation of a site above the ellipsoid is often 
taken to be its altitude above mean sea-level. However, 
mean sea-level is equated with the geoid and not with the 
ellipsoid. Geoid undulations can amount to tens of 
meters (Section 2.4.5.1). They are long-wavelength fea¬ 
tures. Within a local survey the distance between geoid 
and ellipsoid is unlikely to vary much, and the gravity 
differences from the selected base station are unlikely to 
be strongly affected. In a national survey the discrepan¬ 
cies due to geoid undulations may be more serious. In the 
event that geoid undulations are large enough to affect a 
survey, the station altitudes must be corrected to true ele¬ 
vations above the ellipsoid. 


2.5.5 Density determination 

The density of rocks in the vicinity of a gravity profile is 
important for the calculation of the Bouguer plate and 
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Fig. 2.39 Typical mean values and ranges of density for some common 
rock types (data source: Dobrin, 1976). 


terrain corrections. Density is defined as the mass per unit 
of volume of a material. It has different units and 
different numerical values in the c.g.s. and SI systems. For 
example, the density of water is 1 g cm -3 in the c.g.s. 
system, but 1000 kgm -3 in the SI system. In gravity 
prospecting c.g.s. units are still in common use, but are 
slowly being replaced by SI units. The formulas given for 
Ag x and Ag Bp in Eq. (2.71) and Eq. (2.73), respectively, 
require that density be given in kgm -3 . 

A simple way of determining the appropriate density to 
use in a gravity study is to make a representative collection 
of rock samples with the aid of a geological map. The spe¬ 
cific gravity of a sample may be found directly by weighing 
it first in air and then in water, and applying Archimedes' 
principle. This gives its density p r relative to that of water: 

W. 

Pr~ \y —w (2.75) 

a w 

Typically, the densities found for different rock types 
by this method show a large amount of scatter about 
their means, and the ranges of values for different rock 
types overlap (Fig. 2.39). The densities of igneous and 
metamorphic rocks are generally higher than those of 
sedimentary rocks. This method is adequate for recon¬ 
naissance of an area. Unfortunately, it is often difficult to 
ensure that the surface collection of rocks is representa¬ 
tive of the rock types in subsurface structures, so alterna¬ 
tive methods of determining the appropriate density are 
usually employed. Density can be measured in vertical 
boreholes, drilled to explore the nature of a presumed 
structure. The density determined in the borehole is used 
to refine the interpretation of the structure. 
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Fig. 2.40 The empirical relationships between density and the seismic P- 
wave and S-wave velocities in water-saturated sediments and 
sedimentary rocks, igneous and metamorphic rocks (after Ludwig etal., 
1970). 

2.5.5.1 Density from seismic velocities 

Measurements on samples of water-saturated sediments 
and sedimentary rocks, and on igneous and metamorphic 
rocks show that density and the seismic P-wave and S-wave 
velocities are related. The optimum fit to each data-set is a 
smooth curve (Fig. 2.40). Each curve is rather idealized, as 
the real data contain considerable scatter. For this reason 
the curves are best suited for computing the mean density 
of a large crustal body from its mean seismic velocity. 
Adjustments must be made for the higher temperatures 
and pressures at depth in the Earth, which affect both the 
density and the elastic parameters of rocks. Flowever, the 
effects of high pressure and temperature can only be exam¬ 
ined in laboratory experiments on small specimens. It is not 
known to what extent the results are representative of the 
in situ velocity-density relationship in large crustal blocks. 

The velocity-density curves are empirical relation¬ 
ships that do not have a theoretical basis. The P-wave data 
are used most commonly. In conjunction with seismic 
refraction studies, they have been used for modelling the 
density distributions in the Earth’s crust and upper 
mantle responsible for large-scale, regional gravity anom¬ 
alies (see Section 2.6.4). 


2.5.52 Gamma-gamma logging 

The density of rock formations adjacent to a borehole can 
be determined from an instrument in the borehole. The 
principle makes use of the Compton scattering of y-rays by 
loosely bound electrons in the rock adjacent to a borehole. 
An American physicist, Arthur H. Compton, discovered in 
1923 that radiation scattered by loosely bound electrons 
experienced an increase in wavelength. This simple obser¬ 
vation cannot be explained at all if the radiation is treated 
as a wave; the scattered radiation would have the same 
wavelength as the incident radiation. The Compton effect 
is easily explained by regarding the radiation as particles or 
photons, i.e., particles of quantized energy, rather than as 
waves. The energy of a photon is inversely proportional to 
its wavelength. The collision of a y-ray photon with an 
electron is like a collision between billiard balls; part of the 
photon’s energy is transferred to the electron. The scattered 
photon has lower energy and hence a longer wavelength 
than the incident photon. The Compton effect was an 
important verification of quantum theory. 

The density logger, or gamma-gamma logger (Fig. 
2.41), is a cylindrical device that contains a radioactive 
source of y-rays, such as 137 Cs, which emits radiation 
through a narrow slit. The y-ray photons collide with the 
loosely bound electrons of atoms near the hole, and are 
scattered. A scintillation counter to detect and measure 
the intensity of y-rays is located in the tool about 
45-60 cm above the emitter; the radiation reaching it also 
passes through a slit. Emitter and detector are shielded 
with lead, and the tool is pressed against the wall of the 
borehole by a strong spring, so that the only radiation 
registered is that resulting from the Compton scattering 
in the surrounding formation. The intensity of detected 
radiation is determined by the density of electrons, and so 
by the density of rock near to the logging tool. The y-rays 
penetrate only about 15 cm into the rock. 

Calibrated gamma-gamma logs give the bulk density of 
the rock surrounding a borehole. This information is also 
needed for calculating porosity, which is defined as the 
fractional volume of the rock represented by pore spaces. 
Most sedimentary rocks are porous, the amount depend¬ 
ing on the amount of compaction experienced. Igneous 
and metamorphic rocks generally have low porosity, unless 
they have been fractured. Usually the pores are filled with 
air, gas or a fluid, such as water or oil. If the densities of the 
matrix rock and pore fluid are known, the bulk density 
obtained from gamma-gamma logging allows the porosity 
of the rock to be determined. 

2.5.53 Borehole gravimetry 

Modern instrumentation allows gravity to be measured 
accurately in boreholes. One type of borehole gravimeter is 
a modification of the LaCoste-Romberg instrument, 
adapted for use in the narrow borehole and under condi¬ 
tions of elevated temperature and pressure. Alternative 




82 


Gravity, the figure of the Earth and geodynamics 





Fig. 2.41 (a) The design of a gamma-gamma logging device for 
determining density in a borehole (after Telford eta!., 1990), and (b) a 
schematic gamma-gamma log calibrated in terms of the rock density. 
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Fig. 2.43 Determination of the density of near-surface rocks by 
Nettleton's method, (a) Gravity measurements are made on a profile 
across a small hill, (b) The data are corrected for elevation with various 
test values of the density. The optimum density gives minimum 
correlation between the gravity anomaly (Ag B ) and the topography. 


Let gj and g 2 be the values of gravity measured in a verti¬ 
cal borehole at heights h 1 and h 2 , respectively, above the ref¬ 
erence ellipsoid (Fig. 2.42). The difference between g 1 and 
g 1 is due to the different heights and to the material between 
the two measurement levels in the borehole. The value g, 
will be larger than g t for two reasons. First, because the 
lower measurement level is closer to the Earth's center, g, 
will be greater than g, by the amount of the combined ele¬ 
vation correction, namely (0.3086 — (0.0419pX 10 _3 ))A/z 
mgal, where Ah = h x - h 2 . Second, at the lower level h 2 the 
gravimeter experiences an upward Bouguer attraction due 
to the material between the two measurement levels. This 
reduces the measured gravity at h 2 and requires a compen¬ 
sating increase to g 7 of amount (0.0419pX 10 _3 )A/; mgal. 
The difference Ag between the corrected values of g l and g~, 
after reduction to the level h 2 is then 

Ag = (0.3086 - 0.0419p X 10“ 3 )A/z - 0.0419p X 10“ 3 A/2 
= (0.3086 - 0.0838p X 10“ 3 )A/i (2.76) 

Rearranging this equation gives the density p of the mate¬ 
rial between the measurement levels in the borehole: 

p =(3.683 - 11.93^f) X10 3 kg m- 3 (2.77) 


Fig. 2.42 Geometry for computation of the density of a rock layer from 
gravity measurements made in a vertical borehole. 

instruments have been designed on different principles; they 
have a comparable sensitivity of about 0.01 mgal. Their 
usage for down-hole density determination is based on 
application of the free-air and Bouguer plate corrections. 


If borehole gravity measurements are made with an 
accuracy of ±0.01 mgal at a separation of about 10m, the 
density of the material near the borehole can be determined 
with an accuracy of about ±lOkgm -3 . More than 90% of 
the variation in gravity in the borehole is due to material 
within a radius of about 5Ah from the borehole (about 50 m 
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Fig. 2.44 Free-air and 
Bouguer anomalies across a 
mountain range. In (a) the 
mountain is modelled by a 
fully supported block, and in 
(b) the mass of the mountain 
above sea-level (SL) is 
compensated by a less-dense 
crustal root, which projects 
down into the denser mantle 
(based on Bott, 1982). 


(a) 



200 km 



for a distance A h « 10 m between measurement levels). This 
is much larger than the lateral range penetrated by 
gamma-gamma logging. As a result, effects related to the 
borehole itself are unimportant. 

2.5.54 Nettleton's method for near-surface density 

The near-surface density of the material under a hill can 
be determined by a method devised by L. Nettleton that 
compares the shape of a Bouguer gravity anomaly (see 
Section 2.5.6) with the shape of the topography along a 
profile. The method makes use of the combined elevation 
correction (Ag FA + Ag Bp ) and the terrain correction (Ag x ), 
which are density dependent. The terrain correction is 
less important than the Bouguer plate correction and can 
usually be neglected. 

A profile of closely spaced gravity stations is measured 
across a small hill (Fig. 2.43). The combined elevation 
correction is applied to each measurement. Suppose that 
the true average density of the hill is 2600 kgm -3 . If the 
value assumed for p is too small (say, 2400 kgm -3 ), Ag Bp 
at each station will be too small. The discrepancy is 
proportional to the elevation, so the Bouguer gravity 
anomaly is a positive image of the topography. If the 
value assumed for p is too large (say, 2800 kgm -3 ), the 
opposite situation occurs. Too much is subtracted at each 
point, giving a computed anomaly that is a negative image 
of the topography. The optimum value for the density is 
found when the gravity anomaly has minimum correla¬ 
tion with the topography. 

2.5.6 Free-air and Bouguer gravity anomalies 

Suppose that we can measure gravity on the reference 
ellipsoid. If the distribution of density inside the Earth is 
homogeneous, the measured gravity should agree with the 


theoretical gravity given by the normal gravity formula. 
The gravity corrections described in Section 2.5.4 com¬ 
pensate for the usual situation that the point of measure¬ 
ment is not on the ellipsoid. A discrepancy between the 
corrected, measured gravity and the theoretical gravity is 
called a gravity anomaly. It arises because the density of 
the Earth’s interior is not homogeneous as assumed. The 
most common types of gravity anomaly are the Bouguer 
anomaly and the free-air anomaly. 

The Bouguer gravity anomaly (Ag B ) is defined by 
applying all the corrections described individually in 
Section 2.5.4: 

A £b = 8m + < A £fA “ A £bP + A St + A £t,de) “ Sn ( 2 - 78 ) 

In this formula g m and g n are the measured and normal 
gravity values; the corrections in parentheses are the free- 
air correction (Ag FA ), Bouguer plate correction (Ag Bp ), 
terrain correction (Ag x ) and tidal correction (Ag tide ). 

The free-air anomaly Ag- p is defined by applying only 
the free-air, terrain and tidal corrections to the measured 
gravity: 

A £f = Em + ( A £FA + A £t + A Stide) “ Sn ( 2 - 79 ) 

The Bouguer and free-air anomalies across the same 
structure can look quite different. Consider first the topo¬ 
graphic block (representing a mountain range) shown in 
Fig. 2.44a. For this simple structure we neglect the terrain 
and tidal corrections. The difference between the Bouguer 
anomaly and the free-air anomaly arises from the 
Bouguer plate correction. In computing the Bouguer 
anomaly the simple elevation of the measurement station 
is taken into account together with the free-air correction. 
The measured gravity contains the attraction of the land- 
mass above the ellipsoid, which is compensated with the 
Bouguer plate correction. The underground structure 
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does not vary laterally, so the corrected measurement 
agrees with the theoretical value and the Bouguer 
anomaly is everywhere zero across the mountain range. In 
computing the free-air anomaly only the free-air correc¬ 
tion is applied; the part of the measured gravity due to the 
attraction of the landmass above the ellipsoid is not taken 
into account. Away from the mountain-block the 
Bouguer and free-air anomalies are both equal to zero. 
Over the mountain the mass of the mountain-block 
increases the measured gravity compared to the reference 
value and results in a positive free-air anomaly across the 
mountain range. 

In fact, seismic data show that the Earth’s crust is 
usually much thicker than normal under a mountain 
range. This means that a block of less-dense crustal rock 
projects down into the denser mantle (Fig. 2.44b). After 
making the free-air and Bouguer plate corrections there 
remains a Bouguer anomaly due to a block that repre¬ 
sents the “root-zone” of the mountain range. As this is 
less dense than the adjacent and underlying mantle it con¬ 
stitutes a mass deficit. The attraction on a gravimeter at 
stations on a profile across the mountain range will be less 
than in Fig. 2.44a, so the corrected measurement will be 
less than the reference value. A strongly negative Bouguer 
anomaly is observed along the profile. At some distance 
from the mountain-block the Bouguer and free-air anom¬ 
alies are equal but they are no longer zero, because 
the Bouguer anomaly now contains the effect of the 
root-zone. Over the mountain-block the free-air anomaly 
has a constant positive offset from the Bouguer anomaly, 
as in the previous example. Note that, although the free- 
air anomaly is positive, it falls to a very low value over the 
center of the block. At this point the attraction of the 
mountain is partly cancelled by the missing attraction of 
the less-dense root-zone. 

2.6 INTERPRETATION OF GRAVITY ANOMALIES 

2.6.1 Regional and residual anomalies 

A gravity anomaly results from the inhomogeneous distri¬ 
bution of density in the Earth. Suppose that the density of 
rocks in a subsurface body is p and the density of the rocks 
surrounding the body is p 0 . The difference Ap = p-p 0 is 
called the density contrast of the body with respect to the 
surrounding rocks. If the body has a higher density than 
the host rock, it has a positive density contrast; a body with 
lower density than the host rock has a negative density 
contrast. Over a high-density body the measured gravity is 
augmented; after reduction to the reference ellipsoid and 
subtraction of the normal gravity a positive gravity 
anomaly is obtained. Likewise a negative anomaly results 
over a region of low density. The presence of a gravity 
anomaly indicates a body or structure with anomalous 
density; the sign of the anomaly is the same as that of the 
density contrast and shows whether the density of the 
body is higher or lower than normal. 



Fig. 2.45 Representation of the regional anomaly on a gravity profile by 
visually fitting the large-scale trend with a smooth curve. 

The appearance of a gravity anomaly is affected by the 
dimensions, density contrast and depth of the anomalous 
body. The horizontal extent of an anomaly is often called 
its apparent “wavelength.” The wavelength of an anomaly 
is a measure of the depth of the anomalous mass. Large, 
deep bodies give rise to broad (long-wavelength), low- 
amplitude anomalies, while small, shallow bodies cause 
narrow (short-wavelength), sharp anomalies. 

Usually a map of Bouguer gravity anomalies contains 
superposed anomalies from several sources. The long- 
wavelength anomalies due to deep density contrasts 
are called regional anomalies. They are important for 
understanding the large-scale structure of the Earth’s 
crust under major geographic features, such as mountain 
ranges, oceanic ridges and subduction zones. Short- 
wavelength residual anomalies are due to shallow anom¬ 
alous masses that may be of interest for commercial 
exploitation. Geological knowledge is essential for inter¬ 
preting the residual anomalies. In eroded shield areas, like 
Canada or Scandinavia, anomalies with very short wave¬ 
lengths may be due to near-surface mineralized bodies. In 
sedimentary basins, short- or intermediate-wavelength 
anomalies may arise front structures related to reservoirs 
for petroleum or natural gas. 

2.6.2 Separation of regional and residual anomalies 

The separation of anomalies of regional and local origin 
is an important step in the interpretation of a gravity 
map. The analysis may be based on selected profiles 
across some structure, or it may involve the two- 
dimensional distribution of anomalies in a gravity map. 
Numerous techniques have been applied to the decompo¬ 
sition of a gravity anomaly into its constituent parts. 
They range in sophistication from simple visual inspec¬ 
tion of the anomaly pattern to advanced mathematical 
analysis. A few examples of these methods are described 
below. 





2.6 INTERPRETATION OF GRAVITY ANOMALIES 


85 


Fig. 2.46 Removal of regional 
trend from a gravity map by 
contour smoothing: (a) hand- 
drawn smoothing of contour 
lines on original Bouguer 
gravity map, (b) map of 
regional gravity variation, (c) 
residual gravity anomaly after 
subtracting the regional 
variation from the Bouguer 
gravity map (after Robinson 
and qoruh, 1988). Values are 
in mgal. 


(a) Bouguer map 



(b) regional anomaly 



(c) residual anomaly 




Distance (km) 


Fig. 2.47 Representation of the regional trend by a smooth polynomial 
curve fitted to the observed gravity profile by the method of least squares. 


2.6.2.1 Visual analysis 

The simplest way of representing the regional anomaly on a 
gravity profile is by visually fitting the large-scale trend with 
a smooth curve (Fig. 2.45). The value of the regional gravity 
given by this trend is subtracted point by point from the 
Bouguer gravity anomaly. This method allows the inter¬ 
preter to fit curves that leave residual anomalies with a sign 
appropriate to the interpretation of the density distribution. 

This approach may be adapted to the analysis of a 
gravity map by visually smoothing the contour lines. In 
Fig. 2.46a the contour lines of equal Bouguer gravity curve 
sharply around a local abnormality. The more gently 
curved contours have been continued smoothly as dotted 
lines. They indicate how the interpreter thinks the regional 


gravity field (Fig. 2.46b) would continue in the absence of 
the local abnormality. The values of the regional and origi¬ 
nal Bouguer gravity are interpolated from the correspond¬ 
ing maps at points spaced on a regular grid. The regional 
value is subtracted from the Bouguer anomaly at each 
point and the computed residuals are contoured to give a 
map of the local gravity anomaly (Fig. 2.46c). The experi¬ 
ence and skill of the interpreter are important factors in 
the success of visual methods. 

2.62.2 Polynomial representation 

In an alternative method the regional trend is represented 
by a straight line or, more generally by a smooth polyno¬ 
mial curve. If x denotes the horizontal position on a 
gravity profile, the regional gravity Ag R may be written 

a ?r = A?o + A &i x + a ? 2 x ' 2 + a £ 3 x ' 3 + -- + A g„ x " (2.80) 

The polynomial is fitted by the method of least squares 
to the observed gravity profile. This gives optimum values 
for the coefficients A g n . The method also has drawbacks. 
The higher the order of the polynomial, the better it fits 
the observations (Fig. 2.47). The ludicrous extreme is 
when the order of the polynomial is one less than the 
number of observations; the curve then passes perfectly 
through all the data points, but the regional gravity 
anomaly has no meaning geologically. The interpreter’s 
judgement is important in selecting the order of the poly¬ 
nomial, which is usually chosen to be the lowest possible 
order that represents most of the regional trend. 
Moreover, a curve fitted by least squares must pass 
through the mean of the gravity values, so that the resid¬ 
ual anomalies are divided equally between positive and 
negative values. Each residual anomaly is flanked by 
anomalies of opposite sign (Fig. 2.47), which are due to 
the same anomalous mass that caused the central 
anomaly and so have no significance of their own. 

Polynomial fitting can also be applied to gravity maps. 
It is assumed that the regional anomaly can be repre¬ 
sented by a smooth surface, Ag(.v, v), which is a low order 
polynomial of the horizontal position coordinates x and 
y. In the simplest case the regional anomaly is expressed 
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Box 2.4: Fourier analysis 


If integrated over a full cycle, the resulting value of a 
sine or cosine function is zero. The integrated value of 
the product of a sine function and a cosine function is 
also zero. Mathematically, this defines the sines and 
cosines as orthogonal functions. The squared values of 
sines and cosines do not integrate to zero over a full 
cycle; this property can be used to normalize functions 
that can be expressed in terms of sines and cosines. 
These observations may be summarized as follows for 
the functions sin(«0) and cos(nfl) 

2ir 2 tt 

f sin(/ 20 )d 0 = f cos(? 20 )d 0 = 0 

2 it , 2tt 

f sin 2 («0)d0 ^ £ f 0 — cos(2ft0))d0 = tt (1) 

i) 

2-tt 2tt 

fcos 2 («0)d0 = f (1 + cos(2«0))d0 = tt 

V t> 


Ag(x) = flj cos(kx) + b x sm(kx) + a 2 cos(2kx) 

+ b 2 s\n{2kx) + « 3 cos(3fcv) + 6 3 sin(3£x) + • • • 

N 

Ag(x) = 2 («„cos (nkx) + b n sm{nkx)) (3) 

n— 1 

This expression for Ag(x) is called a Fourier series. In 
Eq. (3) the summation is truncated after N sine and 
cosine terms. The value of N is chosen to be as large as 
necessary to describe the gravity anomaly adequately. 
The importance of any individual term of order n is 
given by the values of the corresponding coefficients a n 
and b n , which act as weighting functions. 

The coefficients a n and b n can be calculated using the 
orthogonal properties of sine and cosine functions sum¬ 
marized in Eqs. (1) and (2). If both sides of Eq. (3) for 
the gravity anomaly Ag(x) are multiplied by cos (mkx), 
we get 


If follows by applying these results and invoking the 
formulas for the sums and differences of sines and 
cosines that 


N 

Ag(x)cos(mkx) = ^ ( a„cos(nkx )cos (mkx ) 

n— 1 

+ b n sin(nkx)cos(mkx)) 


(4) 


sin(«0)cos(m0)d0 = ^ j sin^^-^0 


+ sin i 



jd0 = 0 


ZTT ZTT 

cos(nd)cos(mO)dO = sm(nO)sm(m9)dd 



0, if m ^ n 
tt, if m = n 


( 2 ) 


These relationships form the basis of Fourier analysis. 

Geophysical signals that vary periodically in time (e.g., 
seismic waves) or space (e.g., gravity, magnetic or thermal 
anomalies) can be expressed as the superposition of har¬ 
monics of a fundamental frequency or wave number. For 
example, consider a gravity anomaly Ag(x) along a profile 
of length L in the ^-direction. The fundamental “wave¬ 
length” A of the anomaly is equal to 2 L, i.e., twice the 
profile length. A corresponding fundamental wave 
number is defined for the profile as k = (2tt/A), so that the 
argument 6 in the sine and cosine functions in Eqs. (1) and 
(2) is replaced by (kx). The observed anomaly is repre¬ 
sented by adding together components corresponding to 
harmonics of the fundamental wavelength: 


Each product on the right-hand side of this equation 
can be written as the sum or difference of two sines or 
cosines. Thus, 

1 n / 

Ag(x)cos(mkx) — a„[cos((n + m)kx ) 

n=i V 

+ cos ((« — m)kx)] + feJsin((H — m)kx) 

+ sin ((n — m)kx)]J (5) 

Integration of A g(x) cos(2 mkx) over a full wave¬ 
length of x causes all terms on the right side of Eq. (5) to 
vanish unless n = m, when the term cos ((n—m)kx) = 
cos(0) = 1. This allows us to determine the coefficients a n 
in Eq. (3): 

a i a 

Ag(x)cos(nkx )dx = ^a n dx = ^«„ (6) 

J o J o 

2 A 1 2tt 

a„ — ^\ Agf x)cos( nkx )dx = ^ Ag(x)cos(«0)d0 (7) 
2 o ■’o 

The coefficients b n in Eq. (3) are obtained similarly 
by multiplying Ag(x) by sin (mkx) and integrating over a 
full cycle of the signal. This gives 

r . X , 2tT 

b n = ^ J Ag(x)sin(«Ax)dx = tt J Ag(x)sin(«0)d0 (8) 
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Box 2.5: Double Fourier series 

The two-dimensional variation of a mapped gravity 
anomaly can be analyzed with the aid of double 
Fourier series. In this case the gravity anomaly is a 
function of both the x- and y-coordinates and can be 
written 


N M 

A g(x,y) = 

n—\m=\ 

+ c nm S n <? m + d nm S n S* m ) (1) 

where 


C n = cos^2«tt S n = sin^2mr 
C = cos^2»m ST m = sin^rmr 


( 2 ) 


In these expressions the fundamental wavelengths 
A v and A v express the extent of the anomaly in the x- 
and v-directions, respectively. The derivation of the 
coefficients a„. b. c and is similar in principle 
to the one-dimensional case, relying on the orthogo¬ 
nality of the individual sine and cosine terms, and the 
property that the products of two sine terms, two 
cosine terms or a sine term with a cosine term, can be 
expressed as the sum or difference of other sine or 
cosine functions. As might be expected, the analysis of 
double Fourier series is somewhat more complicated 
than in the one-dimensional case, but it delivers results 
that characterize the two-dimensional variation of the 
regional gravity anomaly. 


as a first-order polynomial, or plane. To express changes 
in the gradient of gravity a higher-order polynomial is 
needed. For example, the regional gravity given by a 
second-order polynomial is written 

Ag(x,y) = A g 0 + A g xl x + A g yl y 

+ A g x2 x 2 + A g y2 y 2 + A g xy xy (2.8 1) 

As in the analysis of a profile, the optimum values of 
the coefficients A g xl , A g yl , etc., are determined by 
least-squares fitting. The residual anomaly is again com¬ 
puted point by point by subtracting the regional from the 
original data. 

2.6.23 Representation by Fourier series 

The gravity anomaly along a profile can be analyzed with 
techniques developed for investigating time series. Instead 
of varying with time, as the seismic signal does in a seis¬ 
mometer, the gravity anomaly Ag(x) varies with position 



Fig. 2.48 Expression of the two-dimensional variation of a gravity 
anomaly using double Fourier series: (a) a single harmonic in the x- 
direction, (b) two harmonics in the x-direction, (c) superposed single 
harmonics in thex- andy-directions, respectively (after Davis, 1973). 


x along the profile. For a spatial distribution the wave 
number, k = IttIX , is the counterpart of the frequency of a 
time series. If it can be assumed that its variation is peri¬ 
odic, the function Ag(x) can be expressed as the sum of a 
series of discrete harmonics. Each harmonic is a sine or 
cosine function whose argument is a multiple of the fun¬ 
damental wave number. The expression for Ag(x) is called 
a Fourier series (Box 2.4). 

The breakdown of a complex anomaly (or time 
series) in terms of simpler periodic variations of different 
wavelengths is called Fourier analysis and is a powerful 
method for resolving the most important components of 
the original signal. 

The two-dimensional variation of a mapped gravity 
anomaly can be expressed in a similar way with the aid of 
double Fourier series (Box 2.5). As in the simpler one¬ 
dimensional case of a gravity anomaly on a profile, the 
expression of two-dimensional gravity anomalies by 
double Fourier series is analogous to summing weighted 
sinusoidal functions. These can be visualized as corruga¬ 
tions of the x-y plane (Fig. 2.48), with each corrugation 
weighted according to its importance to A g(x, y). 

2.6.24 Anomaly enhancement and filtering 

The above discussion shows how a function that is peri¬ 
odic can be expressed as a Fourier sum of harmonics of a 
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Box 2.6: Complex numbers 


In determining the roots of cubic and higher order poly¬ 
nomials it is sometimes necessary to use the square 
roots of negative numbers. A negative number can be 
written as the positive number multiplied by (—1), so 
the square root of a negative number contains the imag¬ 
inary unit i, defined as the square root of (—1). A 
complex number z consists of a real part .v and an imag¬ 
inary part y, and is written 

z = x + iy (1) 

The number z* = x — i y is called the complex conju¬ 
gate of z. The product zz* = x 2 Fy 2 gives the squared 
amplitude of the complex number. 

The great Swiss mathematician Leonhard Euler 
showed in 1748 how the trigonometric functions, cost? 
and sind, are related to a complex exponential function: 

cosfl + i sin0 = e l£l (2) 

where e is the base of the natural logarithms and i is the 
imaginary unit. This formula is fundamental to the 
study of functions of a complex variable, a branch of 
mathematics known as complex analysis that can be used 
to solve a wide range of practical and theoretical prob¬ 
lems. In this expression, cost? is called the real part of e Ifl 
and sin0 is called the imaginary part. It follows that 


Imaginary 

axis 



z = x + iy = r(cos0 + isinfl) = re 10 (4) 


i = e ilT/2 

Vi = e-/ 4 = ^(1 +i) (3) 

Complex numbers can be depicted geometrically on 
an Argand diagram, (also called the complex plane) 
invented in 1806 by another Swiss mathematician, Jean- 
Robert Argand, as an aid to their visualization. In this 
diagram (Fig. B2.6) the real part of the number, x in our 
case, is plotted on the horizontal axis and the imaginary 
part, y, on the vertical axis. If the distance of the point P 
at (x, y) from the origin of the plot at O is r, and the 
angle between OP and the x-axis is 6, then the real part 
of the complex number is x = r cos 6 and the imaginary 
part is y = r sin0, so that, using Euler’s formula 


Complex numbers are combined in the same way as 
real numbers. Two complex numbers, z t and z 2 , combine 
to form a new complex number, z. A special case is when 
the complex number is combined with its complex con¬ 
jugate ; this results in a real number. Some examples of 
possible combinations of complex numbers are as 
follows: 

z = Zj + z 2 = (Xj + x 2 ) + i (y x +y 2 ) 
z — r x e l6 i + r 2 e 10 2 

2 = Z 1 Z 2 = (*1 + iyi)(x 2 + iT 2 ) 

= (* 1*2 + y 1 T 2 ) + i(*iT 2 + >’ 1 -F) (6) 

z = r je i8 uye i8 2 = 


fundamental wavelength. By breaking down the observed 
signal into discrete components, it is possible to remove 
some of these and reconstruct a filtered version of the 
original anomaly. However, the requirements of periodic 
behavior and discreteness of harmonic content are often 
not met. For example, the variation of gravity from one 
point to another is usually not periodic. Moreover, if the 
harmonic content of a function is made up of distinct 
multiples of a fundamental frequency or wave number, the 
wavelength spectrum consists of a number of distinct 
values. Yet many functions of geophysical interest are best 
represented by a continuous spectrum of wavelengths. 


To handle this kind of problem the spatial variation of 
gravity is represented by a Fourier integral, which consists 
of a continuous set of frequencies or wave numbers instead 
of a discrete set. The Fourier integral can be used to repre¬ 
sent non-periodic functions. It uses complex numbers (Box 
2.6), which are numbers that involve i, the square-root of 
-1. If the gravity anomaly is analyzed in two dimensions 
(instead of on a single profile), a two-dimensional integral 
is needed, analogous to the double Fourier series represen¬ 
tation described in Section 2.6.2.3 and Box 2.5. The 
observed gravity can then be manipulated using the tech¬ 
niques of Fourier transforms (Box 2.7). These techniques 
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Box 2.7: Fourier transforms 

Functions that cannot be expressed as the Fourier 
series of individual terms, as in Boxes 2.4 and 2.5, may 
be replaced by a Fourier integral This consists of a 
continuous set of frequencies or wave numbers instead 
of a discrete set. The Fourier integral can be used to 
represent non-periodic functions. The gravity anomaly 
Ag(.v) is now written as an integral instead of as a sum 
of discrete terms: 

Ag(.v) = J G(u)e wx du (1) 

where, by using the properties of complex numbers, it 
can be shown that 

G(u ) = 2 ~ j Ag(.v)e _1 " v d.\' (2) 

The complex function G(u) is the Fourier transform 
of the real-valued function A g(x). An adequate treat¬ 
ment of Fourier transforms is beyond the scope of this 
book. Flowever, the use of this powerful mathematical 
technique can be illustrated without delving deeply 
into the theory. 

A map of gravity anomalies can be represented by 
a function A g(x, y) of the Cartesian map coordinates. 
The Fourier transform of A g(x, y) is a two-dimen¬ 
sional complex function that involves wave numbers 
k x and k y defined by wavelengths of the gravity field 
with respect to the x- and y-axes (7r Y = 2 tt/A v , k y = 
2Tr/A r ). It is 

G(x,y)= j j Ag(x,y){cos(k x x + k y y) 

— i sin(£ r v + k v y) Jd.v dv (3) 

This equation assumes that the observations 
A g(x, y) can be represented by a continuous function 
defined over an infinite x-y plane, whereas in fact the 
data are of finite extent and are known at discrete 
points of a measurement grid. In practice, these 
inconsistencies are usually not important. Efficient 
computer algorithms permit the rapid computation 
of the Fourier transform G(x, y) of the gravity 
anomaly A g(x, y). The transformed signal can be 
readily manipulated in the Fourier domain by convo¬ 
lution with a filter function and the result deconvo¬ 
lved back into the spatial domain. This allows the 
operator to examine the effects of low- and high-pass 
filtering on the observed anomaly, which can greatly 
help in its interpretation. 


involve intensive computations and are ideally suited to 
digital data-processing with powerful computers. 

The two-dimensional Fourier transform of a gravity 
map makes it possible to digitally filter the gravity 
anomalies. A filter is a spatial function of the coordinates 
x and y. When the function A g(x, y) representing the 
gravity data is multiplied by the filter function, a new 
function is produced. The process is called convolution 
and the output is a map of the filtered gravity data. The 
computation in the spatial domain defined by the x- and 
y-coordinates can be time consuming. It is often faster to 
compute the Fourier transforms of the gravity and filter 
functions, multiply these together in the Fourier domain, 
then perform an inverse Fourier transform on the 
product to convert it back to the spatial domain. 

The nature of the filter applied in the Fourier domain 
can be chosen to eliminate certain wavelengths. For 
example, it can be designed to cut out all wavelengths 
shorter than a selected wavelength and to pass longer wave¬ 
lengths. This is called a low-pass filter, it passes long wave¬ 
lengths that have low wave numbers. The irregularities in a 
Bouguer gravity anomaly map (Fig. 2.49a) are removed by 
low-pass filtering, leaving a filtered map (Fig. 2.49b) that is 
much smoother than the original. Alternatively, the filter 
in the Fourier domain can be designed to eliminate 
wavelengths longer than a selected wavelength and to pass 
shorter wavelengths. The application of such a high-pass 
filter enhances the short-wavelength (high wave number) 
component of the gravity map (Fig. 2.49c). 

Wavelength filtering can be used to emphasize selected 
anomalies. For example, in studying large-scale crustal 
structure the gravity anomalies due to local small bodies 
are of less interest than the regional anomalies, which can 
be enhanced by applying a low-pass filter. Conversely, in 
the investigation of anomalies due to shallow crustal 
sources the regional effect can be suppressed by high-pass 
filtering. 

2.6.3 Modelling gravity anomalies 

After removal of regional effects the residual gravity 
anomaly must be interpreted in terms of an anomalous 
density distribution. Modern analyses are based on itera¬ 
tive modelling using high-speed computers. Earlier 
methods of interpretation utilized comparison of the 
observed gravity anomalies with the computed anomalies 
of geometric shapes. The success of this simple approach is 
due to the insensitivity of the shape of a gravity anomaly 
to minor variations in the anomalous density distribution. 
Some fundamental problems of interpreting gravity 
anomalies can be learned from the computed effects of 
geometric models. In particular, it is important to realize 
that the interpretation of gravity anomalies is not unique; 
different density distributions can give the same anomaly. 
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Fig. 2.49 The use of wavelength filtering to emphasize selected 
anomalies in the Sierra Nevada, California: (a) unfiltered Bouguer 
gravity map, (b) low-pass filtered gravity map with long-wavelength 
regional anomalies, and (c) high-pass filtered gravity map enhancing 
short-wavelength local anomalies. Contour interval: (a) and (b) 10 mgal, 
(c) S mgal (after Dobrin and Savit, 1988). 


2.6.3.1 Uniform sphere: model for a diapir 

Diapiric structures introduce material of different 
density into the host rock. A low-density salt dome 
(p = 2150kgm -3 ) intruding higher-density carbonate 
rocks (p 0 = 2500kgm -3 ) has a density contrast 

Ap = — 350 kg m -3 and causes a negative gravity anomaly. 
A volcanic plug (p = 2800 kg m -3 ) intruding a granite 
body (p 0 = 2600kgm -3 ) has a density contrast 

Ap= + 200 kg m -3 , which causes a positive gravity 
anomaly. The contour lines on a map of the anomaly are 
centered on the diapir, so all profiles across the center of the 
structure are equivalent. The anomalous body can be 



Fig. 2.50 Gravity anomalies for buried spheres with the same radius R 
and density contrast Ap but with their centers at different depths z 
below the surface. The anomaly of the deeper sphere B is flatter and 
broader than the anomaly of the shallower sphere A. 


modelled eq ually by a vertical cylinder or by a sphere, which 
wewillevaluateherebecauseof the simplicity of themodel. 

Assume a sphere of radius R and density contrast Ap 
with its center at depth z below the surface (Fig. 2.50). The 
attraction A g of the sphere is as though the anomalous 
mass M of the sphere were concentrated at its center. If we 
measure horizontal position from a point above its center, 
at distance x the vertical component A g_ is given by 

Ag r = Agsin0 = G^f (2.82) 

where 

M = ^Tri? 3 Ap and r 2 = z 2 + x 2 


stituting these expressions into Eq. (2.82) and 
ring terms gives 

z 


rear- 


= -^TiGApR 3 


(z 2 + x 2 ) 3/2 


fApR 3 ) 

1 

\ 22 1 

(1 + (x/z) 2 ) 


3/2 




The terms in the first pair of parentheses depend on 
the size, depth and density contrast of the anomalous 
sphere. They determine the maximum amplitude of the 
anomaly, Ag 0 , which is reached over the center of the 
sphere at * = 0. The peak value is given by 


A So 



A pR 3 \ 

z 2 ) 


(2.84) 
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This equation shows how the depth to the center of the 
sphere affects the peak amplitude of the anomaly; the 
greater the depth to the center, the smaller the amplitude 
(Fig. 2.50). For a given depth the same peak anomaly can 
be produced by numerous combinations of Ap and R: a 
large sphere with a low density contrast can give an iden¬ 
tical anomaly to a small sphere with a high density con¬ 
trast. The gravity data alone do not allow us to resolve 
this ambiguity. 

The terms in the second pair of parentheses in Eq. 
(2.83) describe how the amplitude of the anomaly varies 
with distance along the profile. The anomaly is symmetri¬ 
cal with respect to x, reaches a maximum value Ag 0 over 
the center of the sphere (.v = 0) and decreases to zero at 
great distances (x = =°). Note that the larger the depth z, 
the more slowly the amplitude decreases laterally with 
increasing x. A deep source produces a smaller but 
broader anomaly than the same source at shallower depth. 
The width w of the anomaly where the amplitude has one- 
half its maximum value is called the “half-height width.” 
The depth z to the center of the sphere is deduced from 
this anomaly width from the relationship z = 0.652tv. 


2.6.32 Horizontal line element 

Many geologically interesting structures extend to great 
distances in one direction but have the same cross- 
sectional shape along the strike of the structure. If the 
length along strike were infinite, the two-dimensional vari¬ 
ation of density in the area of cross-section would suffice 
to model the structure. However, this is not really valid as 
the lateral extent is never infinite. As a general rule, if the 
length of the structure normal to the profile is more than 
twenty times its width or depth, it can be treated as two- 
dimensional (2D). Otherwise, the end effects due to the 
limited lateral extent of the structure must be taken into 
account in computing its anomaly. An elongate body that 
requires end corrections is sometimes referred to as a 2.5D 
structure. For example, the mass distribution under elon¬ 
gated bodies like anticlines, synclines and faults should be 
modelled as 2.5D structures. Here, we will handle the 
simpler two-dimensional models of these structures. 

Let an infinitely long linear mass distribution with 
mass m per unit length extend horizontally along the y- 
axis at depth z (Fig. 2.51). The contribution d(Ag_) to the 
vertical gravity anomaly A g v at a point on the .v-axis due 
to a small element of length dy is 


d,i 


(2.85) 


The line element extends to infinity along the positive 
and negative v-axis, so its vertical gravity anomaly is 
found by integration: 

oo oo 

f dv f dv 

= Gm -j ,.3 = Gm -j {u l + 2)3/2 (2.86) 

— 00 —00 



Fig. 2.51 Geometry for calculation of the gravity anomaly of an 
infinitely long linear mass distribution with mass m per unit length 
extending horizontally along they-axis at depth z. 


where u 2 = x 2 + z 2 . The integration is simplified by chang¬ 
ing variables, so that y = u tan <p; then dv = u sec 2 <p dtp and 
{u 2 +y 2 ) m = iP sec 3 <p. This gives 



(2.87) 


which, after evaluation of the integral, gives 


a £.- 


IGtni 
z 2 + x 2 


( 2 . 88 ) 


This expression can be written as the derivative of a 
potential function M': 

^ =Gm %= (2-89) 

T' = Gm log e (’,) = Gm log e (^=^) (2 ' 90) 

is called the logarithmic potential. Equations (2.88) and 
(2.90) are useful results for deriving formulas for the 
gravity anomaly of linear structures like an anticline (or 
syncline) or a fault. 


2.63.3 Horizontal cylinder: model for anticline or syncline 

The gravity anomaly of an anticline can be modelled by 
assuming that the upward folding of strata brings rocks 
with higher density nearer to the surface (Fig. 2.52a), 
thereby causing a positive density contrast. A syncline is 
modelled by assuming that its core is filled with strata of 
lower density that cause a negative density contrast. In 
each case the geometric model of the structure is an infi¬ 
nite horizontal cylinder (Fig. 2.52b). 

A horizontal cylinder may be regarded as composed of 
numerous line elements parallel to its axis. The cross- 
sectional area of an element (Fig. 2.53) gives a mass 
anomaly per unit length, m = Apr dO dr. The contribution 
dT' of a line element to the potential at the surface is 


d^F = 2GAplog e (^)rdrd0 


(2.91) 
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Fig. 2.52 Calculation of the 
gravity anomaly of an 
anticline: (a) structural cross- 
section, and (b) geometric 
model by an infinite 
horizontal cylinder. 
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Fig. 2.53 Cross-sectional geometry for calculating the gravity anomaly of 
a buried horizontal cylinder made up of line elements parallel to its axis. 


Integrating over the cross-section of the cylinder gives 
its potential (F; the vertical gravity anomaly of the cylin¬ 
der is then found by differentiating 'F with respect to z. 
Noting that duldz = zlu we get 



(2.92) 


After first carrying out the differentiation within the inte¬ 
gral, this simplifies to 


A S- 


2GApz 2 7 R r 2 r nGR 2 Apz 


(2.93) 


Comparing Eq. (2.93) and Eq. (2.88) it is evident that 
the gravity anomaly of the cylinder is the same as that of 
a linear mass element concentrated along its axis, with 
mass m = ttR 2 Ap per unit length along the strike of the 
structure. The anomaly can be written 


A g, = 2t tG 


(A P R 2 \ 

1 

\ z / 

1 + (x/z) 2 


(2.94) 


The shape of the anomaly on a profile normal to the 
structure (Fig. 2.52b) resembles that of a sphere (Fig. 
2.50). The central peak value A g Q is given by 


Ag 0 = 2ttG\ 


ApR 2 ^ 


(2.95) 


The anomaly of a horizontal cylinder decreases laterally less 
rapidly than that of a sphere, due to the long extent of the 
cylinder normal to the profile. The “half-height width” w of 
the anomaly is again dependent on the depth z to the axis of 
the cylinder; in this case the depth is given by z = 0.5w\ 


2.63.4 Horizontal thin sheet 


We next compute the anomaly of a thin horizontal ribbon 
of infinite length normal to the plane of the profile. This is 
done by Actively replacing the ribbon with numerous infi¬ 
nitely long line elements laid side by side. Let the depth of 
the sheet be z, its thickness t and its density contrast Ap 
(Fig. 2.54a); the mass per unit length in the v-direction of 
a line element of width dx is (Aptdx). Substituting in Eq. 
(2.88) gives the gravity anomaly of the line element; the 
anomaly of the thin ribbon is then computed by integrat¬ 
ing between the limits x { and x 2 (Fig. 2.54b) 

Ag=2GAptz\^l zl 

x \ 


= 2GApt\ 


tan 1 




(2.96) 


Writing tan l (x l lz) = (J) l and tan 1 (x 2 lz) = r/> 2 as in Fig. 
2.54b the equation becomes 


Ag_ = 2GApt[<t> 2 -<t> l ] 


(2.97) 


i.e., the gravity anomaly of the horizontal ribbon is pro¬ 
portional to the angle it subtends at the point of measure¬ 
ment. 

The anomaly of a semi-infinite horizontal sheet is a 
limiting case of this result. For easier reference, the origin 
of ,v is moved to the edge of the sheet, so that distances to 
the left are negative and those to the right are positive 
(Fig. 2.54c). This makes 4’ 1 = — tan _1 (x/z). The remote 
end of the sheet is at infinity, and <)>,, = tt/ 2. The gravity 
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(a) 


1 = 0 x 



Fig. 2.54 Geometry for computation of the gravity anomaly across a 
horizontal thin sheet: (a) subdivision of the ribbon into line elements of 
width dx, (b) thin ribbon between the horizontal limits x, and x 2 , and (c) 
semi-infinite horizontal thin sheet. 
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anomaly is then 

Ag z = 2GApt f+ tan"i(f) 


Fig. 2.55 (a) The gravity anomaly across a vertical fault; (b) structure of 
a fault with vertical displacement h, and (c) model of the anomalous 
body as a semi-infinite horizontal slab of heiqht h. 

(2.98) 


A further example is the infinite horizontal sheet, 
which extends to infinity in the positive and negative x and 
y directions. With 0, = tt/2 and <fr l = —tt/2 the anomaly is 

A g_ = I’uGApt (2.99) 

which is the same as the expression for the Bouguer plate 
correction (Eq. (2.73)). 

2.63.5 Horizontal slab: model for a vertical fault 

The gravity anomaly across a vertical fault increases pro¬ 
gressively to a maximum value over the uplifted side (Fig. 
2.55a). This is interpreted as due to the upward displace¬ 
ment of denser material, which causes a horizontal 
density contrast across a vertical step of height h (Fig. 
2.55b). The faulted block can be modelled as a semi-infi¬ 
nite horizontal slab of height h and density contrast Ap 
with its mid-point at depth z 0 (Fig. 2.55c). 

Let the slab be divided into thin, semi-infinite horizon¬ 
tal sheets of thickness dz at depth z. The gravity anomaly 
of a given sheet is given by Eq. (2.98) with dz for the 
thickness t. The anomaly of the semi-infinite slab is found 


by integrating with respect to z over the thickness of the 
slab; the limits of integration are z —(/i/2) and z + (h!2). 
After slightly rearranging terms this gives 

Ag. = 2GAph f+ M tan" 1 If dz (2.100) 

Zq — h/2 ' ' 

The second expression in the brackets is the mean value 
of the angle tan _1 (x/z) averaged over the height of the fault 
step. This can be replaced to a good approximation by the 
value at the mid-point of the step, at depth z Q . This gives 

Ag=2GA P h f + tan- 1 (|j (2.101) 

Comparison of this expression with Eq. (2.98) shows 
that the anomaly of the vertical fault (or a semi-infinite 
thick horizontal slab) is the same as if the anomalous slab 
were replaced by a thin sheet of thickness h at the mid¬ 
point of the vertical step. Equation (2.101) is called the 
"thin-sheet approximation." It is accurate to about 2% 
provided that z 0 > 2h. 
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2.63.6 Iterative modelling 

The simple geometric models used to compute the gravity 
anomalies in the previous sections are crude represent¬ 
ations of the real anomalous bodies. Modern computer 
algorithms have radically changed modelling methods by 
facilitating the use of an iterative procedure. A starting 
model with an assumed geometry and density contrast is 
postulated for the anomalous body. The gravity anomaly 
of the body is then computed and compared with the 
residual anomaly. The parameters of the model are 
changed slightly and the computation is repeated until the 
discrepancies between the model anomaly and the resid¬ 
ual anomaly are smaller than a determined value. 
However, as in the case of the simple models, this does 
not give a unique solution for the density distribution. 

Two- and three-dimensional iterative techniques are in 
widespread use. The two-dimensional (2D) method 
assumes that the anomalous body is infinitely long paral¬ 
lel to the strike of the structure, but end corrections for 
the possibly limited horizontal extent of the body can be 
made. We can imagine that the cross-sectional shape of 
the body is replaced by countless thin rods or line ele¬ 
ments aligned parallel to the strike. Each rod makes a 
contribution to the vertical component of gravity at the 
origin (Fig. 2.56a). The gravity anomaly of the structure 
is calculated by adding up the contributions of all the line 
elements; mathematically, this is an integration over the 
end surface of the body. Although the theory is beyond 
the scope of this chapter, the gravity anomaly has the fol¬ 
lowing simple form: 

Ag_ = IGAp (f : dO (2.102) 

The angle 9 is defined to lie between the positive .v-axis 
and the radius from the origin to a line element (Fig. 
2.56a), and the integration over the end-surface has been 
changed to an integration around its boundary. The com¬ 
puter algorithm for the calculation of this integral is 
greatly speeded up by replacing the true cross-sectional 
shape with an A-sided polygon (Fig. 2.56a). Apart from 
the assumed density contrast, the only important para¬ 
meters for the computation are the (x, z) coordinates of 
the corners of the polygon. The origin is now moved to 
the next point on a profile across the structure. This move 
changes only the x-coordinates of the corners of the 
polygon. The calculations are repeated for each successive 
point on the profile. Finally the calculated gravity 
anomaly profile across the structure is compared to the 
observed anomaly and the residual differences are evalu¬ 
ated. The coordinates of the corners of the polygon are 
adjusted and the calculation is reiterated until the residu¬ 
als are less than a selected tolerance level. 

The gravity anomaly of a three-dimensional (3D) 
body is modelled in a similar way. Suppose that we have a 
contour map of the body; the contour lines show the 
smooth outline of the body at different depths. We could 
construct a close replica of the body by replacing the 



Fig. 2.56 Methods of computing gravity anomalies of irregular bodies: 
(a) the cross-section of a two-dimensional structure can be replaced with 
a multi-sided polygon (Talwani etal., 1959); (b) a three-dimensional 
body can be replaced with thin horizontal laminae (Talwani and Ewing, 
1960). 

material between successive contour lines with thin 
laminae. Each lamina has the same outline as the 
contour line and has a thickness equal to the contour 
separation. As a further approximation the smooth 
outline of each lamina is replaced by a multi-sided 
polygon (Fig. 2.56b). The gravity anomaly of the 
polygon at the origin is computed as in the 2D case, using 
the (x, y) coordinates of the corners, the thickness of the 
lamina and an assumed density contrast. The gravity 
anomaly of the 3D body at the origin is found by adding 
up the contributions of all the laminae. As in the simpler 
2D example, the origin is now displaced to a new point 
and the computation is repeated. The calculated and 
observed anomalies are now compared and the coordi¬ 
nates of the corners of the laminae are adjusted accord¬ 
ingly; the assumed density distribution can also be 
adjusted. The iterative procedure is repeated until the 
desired match between computed and observed anom¬ 
alies is obtained. 
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2.6.4 Some important regional gravity anomalies 

Without auxiliary information the interpretation of gravity 
anomalies is ambiguous, because the same anomaly can be 
produced by different bodies. An independent data source 
is needed to restrict the choices of density contrast, size, 
shape and depth in the many possible gravity models. The 
additional information may be in the form of surface geo¬ 
logical observations, from which the continuation of struc¬ 
tures at depth is interpreted. Seismic refraction or reflection 
data provide better constraints. 

The combination of seismic refraction experiments 
with precise gravity measurements has a long, successful 
history in the development of models of crustal structure. 
Refraction seismic profiles parallel to the trend of elon¬ 
gate geological structures give reliable information about 
the vertical velocity distribution. However, refraction 
profiles normal to the structural trend give uncertain 
information about the tilt of layers or lateral velocity 
changes. Lateral changes of crustal structure can be inter¬ 
preted from several refraction profiles more or less paral¬ 
lel to the structural trend or from seismic reflection data. 
The refraction results give layer velocities and the depths 
to refracting interfaces. To compute the gravity effect of 
a structure the velocity distribution must first be con¬ 
verted to a density model using a P-wave velocity-density 
relationship like the curve shown in Fig. 2.40. The theo¬ 
retical gravity anomaly over the structure is computed 
using a 2D or 3D method. Comparison with the observed 
gravity anomaly (e.g., by calculating the residual 
differences point by point) indicates the plausibility of the 
model. It is always important to keep in mind that, 
because of the non-uniqueness of gravity modelling, a 
plausible structure is not necessarily the true structure. 
Despite the ambiguities, some characteristic features of 
gravity anomalies have been established for important 
regions of the Earth. 

2.6.4.1 Continental and oceanic gravity anomalies 

In examining the shape of the Earth we saw that the ideal 
reference figure is a spheroid, or ellipsoid of rotation. It is 
assumed that the reference Earth is in hydrostatic equilib¬ 
rium. This is supported by observations of free-air and iso¬ 
static anomalies which suggest that, except in some unusual 
locations such as deep-sea trenches and island arcs in sub- 
duction zones, the continents and oceans are in approxi¬ 
mate isostatic equilibrium with each other. By applying the 
concepts of isostasy (Section 2.7) we can understand the 
large-scale differences between Bouguer gravity anomalies 
over the continents and those over the oceans. In general, 
Bouguer anomalies over the continents are negative, espe¬ 
cially over mountain ranges where the crust is unusually 
thick; in contrast, strongly positive Bouguer anomalies are 
found over oceanic regions where the crust is very thin. 

The inverse relationship between Bouguer anomaly 
amplitude and crustal thickness can be explained with the 


Bouguer anomaly 



Fig. 2.57 Hypothetical Bouguer anomalies over continental and 
oceanic areas. The regional Bouguer anomaly varies roughly inversely 
with crustal thickness and topographic elevation (after Robinson and 
goruh, 1988). 

aid of a hypothetical example (Fig. 2.57). Continental 
crust that has not been thickened or thinned by tectonic 
processes is considered to be “normal” crust. It is typically 
30-35 km thick. Under location A on an undeformed con¬ 
tinental coastal region a thickness of 34 km is assumed. 
The theoretical gravity used in computing a gravity 
anomaly is defined on the reference ellipsoid, the surface 
of which corresponds to mean sea-level. Thus, at coastal 
location A on normally thick continental crust the 
Bouguer anomaly is close to zero. Isostatic compensation 
of the mountain range gives it a root-zone that increases 
the crustal thickness at location B. Seismic evidence shows 
that continental crustal density increases with depth from 
about 2700 kgm -3 in the upper granitic crust to about 
2900 kgm -3 in the lower gabbroic crust. Thus, the density 
in the root-zone is much lower than the typical mantle 
density of 3300-3400 kg m -3 at the same depth under A. 
The low-density root beneath B causes a negative Bouguer 
anomaly, which typically reaches —150 to —200mgal. 

At oceanic location C the vertical crustal structure is 
very different. Two effects contribute to the Bouguer 
anomaly. A 5 km thick layer of sea-water (density 
1030kgm -3 ) overlies the thin basic oceanic crust (density 
2900 kgm -3 ) which has an average thickness of only 
about 6 km. To compute the Bouguer anomaly the sea¬ 
water must be replaced by oceanic crustal rock. The 
attraction of the water layer is inherent in the measured 
gravity so the density used in correcting for the Bouguer 
plate and the topography of the ocean bottom is the 
reduced density of the oceanic crust (i.e., 2900— 1030 = 
1870kgm -3 ). However, a more important effect is that 
the top of the mantle is at a depth of only 11 km. In a ver¬ 
tical section below this depth the mantle has a density of 
3300-3400kgm -3 , much higher than the density of the 
continental crust at equivalent depths below site A. 
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Fig. 2.58 Bouguer gravity 
map of Switzerland (after 
Klingeleand Olivier, 1980). 



The lower 23 km of the section beneath C represents a 
large excess of mass. This gives rise to a strong positive 
Bouguer anomaly, which can amount to 300M00 mgal. 

2.6.4.2 Gravity anomalies across mountain chains 

The typical gravity anomaly across a mountain chain is 
strongly negative due to the large low-density root-zone. 
The Swiss Alps provide a good example of the interpreta¬ 
tion of such a gravity anomaly with the aid of seismic 
refraction and reflection results. A precise gravity survey 
of Switzerland carried out in the 1970s yielded an accu¬ 
rate Bouguer gravity map (Fig. 2.58). The map contains 
effects specific to the Alps. Most obviously, the contour 
lines are parallel to the trend of the mountain range. In 
the south a strong positive anomaly overrides the negative 
anomaly. This is the northern extension of the positive 
anomaly of the so-called Ivrea body, which is a high- 
density wedge of mantle material that was forced into an 
uplifted position within the western Alpine crust during 
an earlier continental collision. In addition, the Swiss 
gravity map contains the effects of low-density sediments 
that fill the Molasse basin north of the Alps, the Po plain 
to the south and the major Alpine valleys. 

In the late 1980s a coordinated geological and geophysi¬ 
cal study - the European Geotraverse (EGT) - was made 
along and adjacent to a narrow path stretching from north¬ 
ern Scandinavia to northern Africa. Detailed reflection 
seismic profiles along its transect through the Central Swiss 
Alps complemented a large amount of new and extant 
refraction data. The seismic results gave the depths to 
important interfaces. Assuming a velocity-density rela¬ 
tionship, a model of the density distribution in the lithos¬ 
phere under the traverse was obtained. Making appropriate 
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Fig. 2.59 Lithosphere density model for the Central Swiss Alps 
along the European Geotraverse transect, compiled from seismic 
refraction and reflection profiles. The 2.5D gravity anomaly 
calculated for this lithospheric structure is compared to the 
observed Bouguer anomaly after removal of the effects of the 
high-density Ivrea body and the low-density sediments in the 
Molasse basin, Po plain and larger Alpine valleys (after Holliger 
and Kissling, 1992). 
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Fig. 2.60 Bouguer and free- 
air gravity anomalies over the 
Mid-Atlantic Ridge near 32°N 
The seismic section is 
projected onto the gravity 
profile. The gravity anomaly 
computed from the density 
model fits the observed 
anomaly well, but is non¬ 
unique (after Talwani etal., 
1965). 
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corrections for end effects due to limited extent along strike, 
a 2.5D gravity anomaly was calculated for this lithospheric 
structure (Fig. 2.59). After removal of the effects of the 
high-density Ivrea body and the low-density sediments, the 
corrected Bouguer gravity profile is reproduced well by the 
anomaly of the lithospheric model. Using the geometric 
constraints provided by the seismic data, the lithospheric 
gravity model favors a subduction zone, dipping gently to 
the south, with a high-density wedge of deformed lower 
crustal rock ('‘melange”) in the middle crust. As already 
noted, the fact that a density model delivers a gravity 
anomaly that agrees well with observation does not estab¬ 
lish the reality of the interpreted structure, which can only 
be confirmed by further seismic imaging. However, the 
gravity model provides an important check on the reason¬ 
ableness of suggested models. A model of crustal or lithos¬ 
pheric structure that does not give an appropriate gravity 
anomaly can reasonably be excluded. 


2.6.43 Gravity anomalies across an oceanic ridge 

An oceanic ridge system is a gigantic submarine moun¬ 
tain range. The difference in depth between the ridge crest 
and adjacent ocean basin is about 3 km. The ridge system 
extends laterally for several hundred kilometers on either 
side of the axis. Gravity and seismic surveys have been 
carried out across several oceanic ridge systems. Some 
common characteristics of continuous gravity profiles 
across a ridge are evident on a WNW-ESE transect cross¬ 
ing the Mid-Atlantic Ridge at 32°N (Fig. 2.60). The free- 
air gravity anomalies are small, around 50 mgal or less, 
and correlate closely with the variations in ocean-bottom 
topography. This indicates that the ridge and its flanks are 
nearly compensated isostatically. As expected for an 
oceanic profile, the Bouguer anomaly is strongly positive. 
It is greater than 350 mgal at distances beyond 1000 km 
from the ridge, but decreases to less than 200 mgal over 
the axis of the ridge. 
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Fig. 2.61 Free-air gravity 
anomaly across the Mid- 
Atlantic Ridge near 46°N, and 
the lithospheric density model 
for the computed anomaly 
(after Keen and Tramontini 
1970). 




The depths to important refracting interfaces and the 
P-wave layer velocities are known from seismic refraction 
profiles parallel to the ridge. The seismic structure away 
from the ridge is layered, with P-wave velocities of 
4-5 km s -1 for the basalts and gabbros in oceanic Layer 2 
and 6.5-6.8 km s -1 for the meta-basalts and meta-gabbros 
in Layer 3; the Moho is at about 11km depth, below 
which typical upper-mantle velocities of 8-8.4 km s -1 are 
found. However, the layered structure breaks down under 
the ridge at distances less than 400 km from its axial zone. 
Unusual velocities around 7.3 km s _I occurred at several 
places, suggesting the presence of anomalous low-density 
mantle material at comparatively shallow depth beneath 
the ridge. The seismic structure was converted to a density 
model using a velocity- density relationship. Assuming a 
2D structure, several density models were found that 
closely reproduced the Bouguer anomaly. However, to 
satisfy the Bouguer anomaly on the ridge flanks each 
model requires a flat body in the upper mantle beneath the 
ridge; it extends down to about 30 km depth and 
for nearly 1000 km on each side of the axis (Fig. 2.60). 
The density of the anomalous structure is only 
3150kgm~ 3 instead of the usual 3400 kgm" 3 . The model 
was proposed before the theory of plate tectonics was 


accepted. The anomalous upper mantle structure satisfies 
the gravity anomaly but has no relation to the known 
physical structure of a constructive plate margin. A broad 
zone of low upper-mantle seismic velocities was not 
found in later experiments, but narrow low-velocity zones 
are sometimes present close to the ridge axis. 

A further combined seismic and gravity study of the 
Mid-Atlantic Ridge near 46°N gave a contradictory 
density model. Seismic refraction results yielded P-wave 
velocities of 4.6 km s -1 and 6.6 km s -1 for Layer 2 and 
Layer 3, respectively, but did not show anomalous mantle 
velocities beneath the ridge except under the median 
valley. A simpler density model was postulated to account 
for the gravity anomaly (Fig. 2.61). The small-scale free- 
air anomalies are accounted for by the variations in ridge 
topography seen in the bathymetric plot. The large-scale 
free-air gravity anomaly is reproduced well by a wedge- 
shaped structure that extends to 200 km depth. Its base 
extends to hundreds of kilometers on each side of the 
axis. A very small density contrast of only —40kgm -3 
suffices to explain the broad free-air anomaly. The model 
is compatible with the thermal structure of an accreting 
plate margin. The low-density zone may be associated 
with hot material from the asthenosphere, which rises 
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Fig. 2.62 Observed and 
computed free-air gravity 
anomalies across a 
subduction zone. The density 
model for the computed 
anomaly is based on seismic, 
thermal and petrological data. 
The profile crosses the Chile 
trench and Andes mountains 
at 23°S (after Grow and 
Bowin, 1975). 




beneath the ridge, melts and accumulates in a shallow 
magma chamber within the oceanic crust. 

2.6.44 Gravity anomalies at subduction zones 

Subduction zones are found primarily at continental 
margins and island arcs. Elongate, narrow and intense 
isostatic and free-air gravity anomalies have long been 
associated with island arcs. The relationship of gravity to 
the structure of a subduction zone is illustrated by the 
free-air anomaly across the Chile trench at 23°S (Fig. 
2.62). Seismic refraction data define the thicknesses of the 
oceanic and continental crust. Thermal and petrological 
data are integrated to give a density model for the struc¬ 
ture of the mantle and the subducting lithosphere. 

The continental crust is about 65 km thick beneath the 
Andes mountains, and gives large negative Bouguer 
anomalies. The free-air gravity anomaly over the Andes is 
positive, averaging about +50 mgal over the 4 km high 
plateau. Even stronger anomalies up to+ 100 mgal are 
seen over the east and west boundaries of the Andes. This 
is largely due to the edge effect of the low-density Andean 
crustal block (see Fig. 2.44b and Section 2.5.6). 

A strong positive free-air anomaly of about+70mgal 
lies between the Andes and the shore-line of the Pacific 
ocean. This anomaly is due to the subduction of the Nazca 
plate beneath South America. The descending slab is old 
and cool. Subduction exposes it to higher temperatures 
and pressures, but the slab descends faster than it can be 
heated up. The increase in density accompanying greater 
depth and pressure outweighs the decrease in density due 
to hotter temperatures. There is a positive density contrast 
between the subducting lithosphere and the surrounding 
mantle. Also, petrological changes accompanying the sub¬ 


duction result in mass excesses. Peridotite in the upper 
lithosphere changes phase from plagioclase-type to the 
higher-density garnet-type. When oceanic crust is sub¬ 
ducted to depths of 30-80 km, basalt changes phase to 
eclogite, which has a higher density (3560-3580kgm -3 ) 
than upper mantle rocks. These effects combine to produce 
the positive free-air anomaly. 

The Chile trench is more than 2.5 km deeper than the 
ocean basin to the west. The sediments flooring the trench 
have low density. The mass deficiency of the water and 
sediments in the trench cause a strong negative free-air 
anomaly, which parallels the trench and has an amplitude 
greater than -250 mgal. A small positive anomaly of 
about+ 20 mgal is present about 100 km seaward of the 
trench axis. This anomaly is evident also in the mean level 
of the ocean surface as mapped by SEASAT (Fig. 2.28), 
which shows that the mean sea surface is raised in front of 
deep ocean trenches. This is due to upward flexure of the 
lithosphere before its downward plunge into the subduc¬ 
tion zone. The flexure elevates higher-density mantle rocks 
and thereby causes the small positive free-air anomaly. 

2.7 ISOSTASY 

2.7.1 The discovery of isostasy 

Newton formulated the law of universal gravitation in 1687 
and confirmed it with Kepler’s laws of planetary motion. 
However, in the seventeenth and eighteenth centuries the 
law could not be used to calculate the mass or mean density 
of the Earth, because the value of the gravitational con¬ 
stant was not yet known (it was first determined by 
Cavendish in 1798). Meanwhile, eighteenth century scien¬ 
tists attempted to estimate the mean density of the Earth 













100 Gravity, the figure of the Earth and geodynamics 


by various means. They involved comparing the attraction 
of the Earth with that of a suitable mountain, which could 
be calculated. Inconsistent results were obtained. 

During the French expedition to Peru in 1737-1740, 
Pierre Bouguer measured gravity with a pendulum at 
different altitudes, applying the elevation correction term 
which now bears his name. If the density of crustal rocks 
is p and the mean density of the Earth is p 0 , the ratio of the 
Bouguer-plate correction (see Section 2.5.4.3) for eleva¬ 
tion h to mean gravity for a spherical Earth of radius R is 

Ag BP = 2 tt Gpli _ 3 /pV h\ (2 103) 

* I'nGp.R 2 \Po)\r) 


From the results he obtained near Quito, Bouguer esti¬ 
mated that the mean density of the Earth was about 4.5 
times the density of crustal rocks. 

The main method employed by Bouguer to determine 
the Earth’s mean density consisted of measuring the 
deflection of the plumb-line (vertical direction) by the 
mass of a nearby mountain (Fig. 2.63). Suppose the ele¬ 
vation of a known star is measured relative to the local 
vertical direction at points N and S on the same meridian. 
The elevations should be a N and a s , respectively. Their 
sum is a, the angle subtended at the center of the Earth by 
the radii to N and S, which corresponds to the difference 
in latitude. If N and S lie on opposite sides of a large 
mountain, the plumb-line at each station is deflected by 
the attraction of the mountain. The measured elevations 
of the star are j3 n and /3 S , respectively, and their sum is /3. 
The local vertical directions now intersect at the point D 
instead of at the center of the (assumed spherical) Earth. 
The difference 8= (3 —a is the sum of the deviations of 
the vertical direction caused by the mass of the mountain. 

The horizontal attraction / of the mountain can be 
calculated from its shape and density, with a method that 
resembles the computation of the topographic correction 
in the reduction of gravity measurements. Dividing the 
mountain into vertical cylindrical elements, the horizon¬ 
tal attraction of each element is calculated and its compo¬ 
nent (Gp/Q towards the center of mass of the mountain is 
found. Summing up the effects of all the cylindrical ele¬ 
ments in the mountain gives the horizontal attraction / 
towards its center of mass. Comparing / with mean 
gravity g, we then write 


/ G PlL h i H h , 

tan 8 = t,= 


g 


jttGpqR jttR 


(£) 


(2.104) 


For very small angles, tanS is equal to 8, and so the 
deflection of the vertical is proportional to the ratio p/p 0 
of the mean densities of the mountain and the Earth. 
Bouguer measured the deflection of the vertical caused 
by Mt. Chimborazo (6272 m), Ecuador’s highest moun¬ 
tain. His results gave a ratio p/p 0 of around 12, which is 
unrealistically large and quite different from the values he 
had obtained near Quito. The erroneous result indicated 
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Fig. 2.63 Deviations of the local plumb-line at N and S on opposite 
sides of a large mountain cause the local vertical directions to intersect 
at the point D instead of at the center of the Earth. 


that the deflection of the vertical caused by the mountain 
was much too small for its estimated mass. 

In 1774 Bouguer’s Chimborazo experiment was 
repeated in Scotland by Neville Maskelyne on behalf of 
the Royal Society of London. Measurements of the eleva¬ 
tions of stars were made on the north and south flanks of 
Mt. Schiehallion at sites that differed in latitude by 42.9" 
of arc. The observed angle between the plumb-lines was 
54.6". The analysis gave a ratio p/p 0 equal to 1.79, suggest¬ 
ing a mean density for the Earth of 4500 kgm" 3 . This was 
more realistic than Bouguer’s result, which still needed 
explanation. 

Further results accumulated in the first half of the 
nineteenth century. From 1806 to 1843 the English geode¬ 
sist George Everest carried out triangulation surveys in 
India. He measured by triangulation the separation of a 
site at Kalianpur on the Indo-Ganges plain from a site at 
Kaliana in the foothills of the Himalayas. The distance 
differed substantially from the separation of the sites 
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computed from the elevations of stars, as in Fig. 2.63. The 
discrepancy of 5.23" of arc (162 m) was attributed to 
deflection of the plumb-line by the mass of the 
Flimalayas. This would affect the astronomic determina¬ 
tion but not the triangulation measurement. In 1855 J. H. 
Pratt computed the minimum deflection of the plumb- 
line that might be caused by the mass of the Himalayas 
and found that it should be 15.89" of arc, about three 
times larger than the observed deflection. Evidently the 
attraction of the mountain range on the plumb-line was 
not as large as it should be. 

The anomalous deflections of the vertical were first 
understood in the middle of the nineteenth century, when 
it was realized that there are regions beneath mountains - 
‘Toot-zones” - in which rocks have a lower density than 
expected. The deflection of a plumb-line is not caused 
only by the horizontal attraction of the visible part of a 
mountain. The deficiency of mass at depth beneath the 
mountain means that the “hidden part” exerts a reduced 
lateral attraction, which partly offsets the effect of the 
mountain and diminishes the deflection of the vertical. In 
1889 C. E. Dutton referred to the compensation of a 
topographic load by a less-dense subsurface structure as 
isostasy. 

2.7.2 Models of isostasy 

Separate explanations of the anomalous plumb-line 
deflections were put forward by G. B. Airy in 1855 and 
J. H. Pratt in 1859. Airy was the Astronomer Royal and 
director of the Greenwich Observatory. Pratt was an 
archdeacon of the Anglican church at Calcutta, India, 
and a devoted scientist. Their hypotheses have in com¬ 
mon the compensation of the extra mass of a mountain 
above sea-level by a less-dense region (or root) below 
sea-level, but they differ in the way the compensation is 
achieved. In the Airy model, when isostatic compensa¬ 
tion is complete, the mass deficiency of the root equals 
the excess load on the surface. At and below a certain 
compensation depth the pressure exerted by all overlying 
vertical columns of crustal material is then equal. The 
pressure is then hydrostatic, as if the interior acted like a 
fluid. Hence, isostatic compensation is equivalent to 
applying Archimedes’ principle to the uppermost layers 
of the Earth. 

The Pratt and Airy models achieve compensation 
locally by equalization of the pressure below vertical 
columns under a topographic load. The models were very 
successful and became widely used by geodesists, who 
developed them further. In 1909-1910, J. F. Hayford in the 
United States derived a mathematical model to describe 
the Pratt hypothesis. As a result, this theory of isostasy is 
often called the Pratt-Hayford scheme of compensation. 
Between 1924 and 1938 W. A. Heiskanen derived sets of 
tables for calculating isostatic corrections based on the 
Airy model. This concept of isostatic compensation has 
since been referred to as the Airy-Heiskanen scheme. 
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Fig. 2.64 Local isostatic compensation according to (a) the 
Airy-Heiskanen model and (b) the Pratt-Hayford model; (c) regional 
compensation according to the elastic plate model of Vening Meinesz. 

It became apparent that both models had serious defi¬ 
ciencies in situations that required compensation over a 
larger region. In 1931 F. A. Vening Meinesz, a Dutch geo¬ 
physicist, proposed a third model, in which the crust acts 
as an elastic plate. As in the other models, the crust floats 
buoyantly on a substratum, but its inherent rigidity 
spreads topographic loads over a broader region. 

2.7.2d The Airy-Heiskanen model 

According to the Airy-Heiskanen model of isostatic 
compensation (Fig. 2.64a) an upper layer of the Earth 
“floats” on a denser magma-like substratum, just as ice¬ 
bergs float in water. The upper layer is equated with the 
crust and the substratum with the mantle. The height of a 
mountain above sea-level is much less than the thickness 
of the crust underneath it, just as the visible tip of an 
iceberg is much smaller than the subsurface part. The 
densities of the crust and mantle are assumed to be con¬ 
stant; the thickness of the root-zone varies in proportion 
to the elevation of the topography. 

The analogy to an iceberg is not exact, because 
under land at sea-level the “normal” crust is already 
about 30-35 km thick; the compensating root-zone of a 
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mountain lies below this depth. Oceanic crust is only 
about 10 km thick, thinner than the “normal” crust. The 
mantle between the base of the oceanic crust and the 
normal crustal depth is sometimes called the anti-root of 
the ocean basin. 

The Airy-Heiskanen model assumes local isostatic 
compensation, i.e., the root-zone of a mountain lies 
directly under it. Isostasy is assumed to be complete, so 
that hydrostatic equilibrium exists at the compensation 
depth, which is equivalent to the base of the deepest 
mountain root. The pressure at this level is due to the 
weight of the rock material in the overlying vertical 
column (of basal area one square meter) extending to the 
Earth’s surface. The vertical column for the mountain of 
height /?j in Fig. 2.64a contains only crustal rocks of 
density p c . The pressure at CC' due to the mountain, 
“normal” crust of thickness t, and a root-zone of thick¬ 
ness ;-| amounts to (h ] + t + r f )p c . The vertical column 
below the “normal” crust contains a thickness t of crustal 
rocks and thickness r j of mantle rocks; it exerts a pressure 
of (tp c + r jp m ). For hydrostatic equilibrium the pressures 
are equal. Equating, and noting that each expression con¬ 
tains the term tp c , we get 



with a similar expression for the root of depth r 2 under 
the hill of height The thickness r () of the anti-root of 
the oceanic crust under an ocean basin of water depth d 
and density p w is given by 


r 


o 



(2.106) 


Beneath a continental region at sea-level the pressure of 
the rock column of density p c is p Q D. Finder an ocean 
basin the pressure at CC' is due to water of depth d and 
density p w on top of a rock column of thickness (D-d) 
and density p 0 ; it is equal to p w d+p 0 (D — d). Equating 
these pressures, we get 

= (2.107) 


for the density below a topographic elevation h p and 


Pc D - Pw d 
p ° D-d 


(2.108) 


for the density under an oceanic basin of depth d. The 
compensation depth D is about 100 km. 

The Pratt-Hayford and Airy-Heiskanen models rep¬ 
resent local isostatic compensation , in which each column 
exerts an equal pressure at the compensation level. At the 
time these models were proposed very little was yet 
known about the internal structure of the Earth. This was 
only deciphered after the development of seismology in 
the late nineteenth and early twentieth century. Each 
model is idealized, both with regard to the density dis¬ 
tributions and the behavior of Earth materials. For 
example, the upper layer is assumed to offer no resistance 
to shear stresses arising from vertical adjustments 
between adjacent columns. Yet the layer has sufficient 
strength to resist stresses due to horizontal differences in 
density. It is implausible that small topographic features 
require compensation at large depths; more likely, they 
are entirely supported by the strength of the Earth’s crust. 


The Airy-Heiskanen model assumes an upper layer of 
constant density floating on a more dense substratum. It 
has root-zones of variable thickness proportional to the 
overlying topography. This scenario agrees broadly with 
seismic evidence for the thickness of the Earth’s crust (see 
Section 3.7). The continental crust is much thicker than 
the oceanic crust. Its thickness is very variable, being 
largest below mountain chains, although the greatest 
thickness is not always under the highest topography. 
Airy-type compensation suggests hydrostatic balance 
between the crust and the mantle. 

2.72.2 The Pratt-Hayford model 

The Pratt-Hayford isostatic model incorporates an outer 
layer of the Earth that rests on a weak magmatic substra¬ 
tum. Differential expansion of the material in vertical 
columns of the outer layer accounts for the surface 
topography, so that the higher the column above a 
common base the lower the mean density of rocks in it. 
The vertical columns have constant density from the 
surface to their base at depth D below sea-level (Fig. 
2.64b). If the rock beneath a mountain of height h t (i= 1, 
2,...) has density p p the pressure at CC' is pfh j + D). 


2.72.3 Vening Meinesz elastic plate model 

In the 1920s F. A. Vening Meinesz made extensive gravity 
surveys at sea. His measurements were made in a subma¬ 
rine to avoid the disturbances of wave motions. He studied 
the relationship between topography and gravity anom¬ 
alies over prominent topographic features, such as the deep 
sea trenches and island arcs in southeastern Asia, and con¬ 
cluded that isostatic compensation is often not entirely 
local. In 1931 he proposed a model of regional isostatic 
compensation which, like the Pratt-Hayford and Airy- 
Heiskanen models, envisages a light upper layer that floats 
on a denser fluid substratum. However, in the Vening 
Meinesz model the upper layer behaves like an elastic plate 
overlying a weak fluid. The strength of the plate distributes 
the load of a surface feature (e.g., an island or seamount) 
over a horizontal distance wider than the feature (Fig. 
2.64c). The topographic load bends the plate downward 
into the fluid substratum, which is pushed aside. The buoy¬ 
ancy of the displaced fluid forces it upward, giving support 
to the bent plate at distances well away from the central 
depression. The bending of the plate which accounts for 
the regional compensation in the Vening Meinesz model 
depends on the elastic properties of the lithosphere. 
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Fig. 2.65 Explanation of the isostatic gravity anomaly (Ag) as the 
difference between the Bouguer gravity anomaly (Ag B ) and the 
computed anomaly (Ag R ) of the root-zone estimated from the 
topography for (a) complete isostatic compensation, (b) isostatic 
overcompensation and (c) isostatic undercompensation. 


2.7.3 Isostatic compensation and vertical crustal 
movements 

In the Pratt-Hayford and Airy-Heiskanen models the 
lighter crust floats freely on the denser mantle. The system 
is in hydrostatic equilibrium, and local isostatic compensa¬ 
tion is a simple application of Archimedes’ principle. A 
“normal” crustal thickness for sea-level coastal regions is 
assumed (usually 30-35 km) and the additional depths of 
the root-zones below this level are exactly proportional to 
the elevations of the topography above sea-level. The 
topography is then completely compensated (Fig. 2.65a). 
However, isostatic compensation is often incomplete. The 
geodynamic imbalance leads to vertical crustal movements. 

Mountains are subject to erosion, which can disturb iso¬ 
static compensation. If the eroded mountains are no longer 
high enough to justify their deep root-zones, the topogra¬ 
phy is isostatically overcompensated (Fig. 2.65b). Buoyancy 
forces are created, just as when a wooden block floating in 
water is pressed downward by a finger; the underwater part 
becomes too large in proportion to the amount above the 
surface. If the finger pressure is removed, the block 
rebounds in order to restore hydrostatic equilibrium. 
Similarly, the buoyancy forces that result from overcom- 



Fig. 2.66 Fennoscandian rates of vertical crustal movement (in 
mm yr _1 ) relative to mean sea-level. Positive rates correspond to uplift, 
negative rates to subsidence (after Kakkuri, 1992). 

pensation of mountainous topography cause vertical uplift. 
The opposite scenario is also possible. When the visible 
topography has roots that are too small, the topography is 
isostatically undercompensated (Fig. 2.65c). This situation 
can result, for example, when tectonic forces thrust crustal 
blocks on top of each other. Hydrostatic equilibrium is 
now achieved by subsidence of the uplifted region. 

The most striking and best-observed examples of verti¬ 
cal crustal movements due to isostatic imbalance are 
related to the phenomenon of glacial rebound observed in 
northern Canada and in Fennoscandia. During the latest 
ice-age these regions were covered by a thick ice-cap. The 
weight of ice depressed the underlying crust. Subsequently 
melting of the ice-cap removed the extra load on the crust, 
and it has since been rebounding. At stations on the 
Fennoscandian shield, modern tide-gauge observations 
and precision levelling surveys made years apart allow the 
present uplift rates to be calculated (Fig. 2.66). The contour 
lines of equal uplift rate are inexact over large areas due to 
the incompleteness of data from inaccessible regions. 
Nevertheless, the general pattern of glacial rebound is 
clearly recognizable, with uplift rates of up to 8 mm yr -1 . 

2.7.4 Isostatic gravity anomalies 

The different degrees of isostatic compensation find expres¬ 
sion in gravity anomalies. As explained in Section 2.5.6 the 
free-air gravity anomaly A g F is small near the center of a 
large region that is isostatically compensated; the Bouguer 
anomaly Ag B is strongly negative. Assuming complete iso¬ 
static compensation, the size and shape of the root-zone 











104 Gravity, the figure of the Earth and geodynamics 


Fig. 2.67 Isostatic gravity 
anomalies in Switzerland 
(after Klingele and Kissling, 
1982), based on the national 
gravity map (Klingele and 
Olivier, 1980), corrected for 
the effects of the Molasse 
basin and the Ivrea body. 



can be determined from the elevations of the topography. 
With a suitable density contrast the gravity anomaly Ag R of 
the modelled root-zone can be calculated; because the root- 
zone has lower density than adjacent mantle rocks Ag R is 
also negative. The isostatic gravity anomaly A g l is defined as 
the difference between the Bouguer gravity anomaly and 
the computed anomaly of the root-zone, i.e., 

Agj = Ag B — Ag R (2.109) 

Examples of the isostatic gravity anomaly for the three 
types of isostatic compensation are shown schematically in 
Fig. 2.65. When isostatic compensation is complete, the 
topography is in hydrostatic equilibrium with its root-zone. 
Both Ag B and Ag R are negative but equal; consequently, 
the isostatic anomaly is everywhere zero (Agj = 0). In the 
case of overcompensation the eroded topography suggests 
a root-zone that is smaller than the real root-zone. The 
Bouguer anomaly is caused by the larger real root, so Ag B 
is numerically larger than Ag R . Subtracting the smaller 
negative anomaly of the computed root-zone leaves a neg¬ 
ative isostatic anomaly (Ag : <0). On the other hand, with 
undercompensation the topography suggests a root-zone 
that is larger than the real root-zone. The Bouguer 
anomaly is caused by the smaller real root, so Ag B is 
numerically smaller than Ag R . Subtracting the larger nega¬ 
tive anomaly of the root-zone leaves a positive isostatic 
anomaly (Ag : >0). 

A national gravity survey of Switzerland carried out in 
the 1970s gave a high-quality map of Bouguer gravity 
anomalies (see Fig. 2.58). Seismic data gave representa¬ 
tive parameters for the Central European crust and 
mantle: a crustal thickness of 32 km without topography, 
and mean densities of 2670 kg m -3 for the topography, 
2810kgm“ 3 for the crust and 3310kgm -3 for the mantle. 
Using the Airy-Heiskanen model of compensation, a 


map of isostatic gravity anomalies in Switzerland was 
derived (Fig. 2.67) after correcting the gravity map for the 
effects of low-density sediments in the Molasse basin 
north of the Alps and high-density material in the anom¬ 
alous Ivrea body in the south. 

The pattern of isostatic anomalies reflects the different 
structures beneath the Jura mountains, which do not have 
a prominent root-zone, and the Alps, which have a low- 
density root that extends to more than 55 km depth in 
places. The dominant ENE-WSW trend of the isostatic 
gravity anomaly contour lines is roughly parallel to the 
trends of the mountain chains. In the northwest, near the 
Jura mountains, positive isostatic anomalies exceed 
20mgal. In the Alps isostatic anomalies are mainly nega¬ 
tive, reaching more than — 50 mgal in the east. 

A computation based on the Veiling Meinesz model 
gave an almost identical isostatic anomaly map. The agree¬ 
ment of maps based on the different concepts of isostasy is 
somewhat surprising. It may imply that vertical crustal 
columns are not free to adjust relative to one another 
without friction as assumed in the Airy-Heiskanen model. 
The friction is thought to result from horizontal compres¬ 
sive stresses in the Alps, which are active in the on-going 
mountain-building process. 

Comparison of the isostatic anomaly map with one of 
recent vertical crustal movements (Fig. 2.68) illustrates the 
relevance of isostatic gravity anomalies for tectonic inter¬ 
pretation. Precise levelling surveys have been carried out 
since the early 1900s along major valleys transecting and 
parallel to the mountainous topography of Switzerland. 
Relative rates of uplift or subsidence are computed from 
the differences between repeated surveys. The results have 
not been tied to absolute tide-gauge observations and so 
are relative to a base station at Aarburg in the canton of 
Aargau, in the northeast. 
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Fig. 2.68 Rates of vertical 
crustal motion in Switzerland 
deduced from repeated 
precise levelling. Broken 
contour lines indicate areas in 
which geodetic data are 
absent or insufficient. Positive 
rates correspond to uplift, 
negative rates to subsidence 
(data source: Gubler, 1991). 



The rates of relative vertical movement in northeastern 
Switzerland are smaller than the confidence limits on the 
data and may not be significant, but the general tendency 
suggests subsidence. This region is characterized by mainly 
positive isostatic anomalies. The rates of vertical move¬ 
ment in the southern part of Switzerland exceed the noise 
level of the measurements and are significant. The most 
notable characteristic of the recent crustal motions is verti¬ 
cal uplift of the Alpine part of Switzerland relative to the 
central plateau and Jura mountains. The Alpine uplift 
rates are up to 1.5 mm yr _1 , considerably smaller than the 
rates observed in Fennoscandia. The most rapid uplift 
rates are observed in the region where isostatic anomalies 
are negative. The constant erosion of the mountain topog¬ 
raphy relieves the crustal load and the isostatic response is 
uplift. However, the interpretation is complicated by the 
fact that compressive stresses throughout the Alpine 
region acting on deep-reaching faults can produce non¬ 
isostatic uplift of the surface. The separation of isostatic 
and non-isostatic vertical crustal movements in the Alps 
will require detailed and exact information about the struc¬ 
ture of the lithosphere and asthenosphere in this region. 

2.8 RHEOLOGY 

2.8.1 Brittle and ductile deformation 

Rheology is the science of the deformation and flow of 
solid materials. This definition appears at first sight to 
contradict itself. A solid is made up of particles that 
cohere to each other; it is rigid and resists a change of 
shape. A fluid has no rigidity; its particles can move about 
comparatively freely. So how can a solid flow? In fact, the 
way in which a solid reacts to stress depends on how large 
the stress is and the length of time for which it is applied. 


Provided the applied stress does not exceed the yield stress 
(or elastic limit) the short-term behavior is elastic. This 
means that any deformation caused by the stress is com¬ 
pletely recoverable when the stress is removed, leaving no 
permanent change in shape. However, if the applied stress 
exceeds the yield stress, the solid may experience either 
brittle or ductile deformation. 

Brittle deformation consists of rupture without other 
distortion. This is an abrupt process that causes faulting 
in rocks and earthquakes, accompanied by the release of 
elastic energy in the form of seismic waves. Brittle frac¬ 
ture occurs at much lower stresses than the intrinsic 
strength of a crystal lattice. This is attributed to the pres¬ 
ence of cracks, which modify the local internal stress field 
in the crystal. Fracture occurs under either extension or 
shear. Extensional fracture occurs on a plane at right 
angles to the direction of maximum tension. Shear frac¬ 
ture occurs under compression on one of two comple¬ 
mentary planes which, reflecting the influence of internal 
friction, are inclined at an angle of less than 45° (typically 
about 30°) to the maximum principal compression. Brittle 
deformation is the main mechanism in tectonic processes 
that involve the uppermost 5-10 km of the lithosphere. 

Ductile deformation is a slow process in which a solid 
acquires strain (i.e., it changes shape) over a long period 
of time. A material may react differently to a stress that is 
applied briefly than to a stress of long duration. If it 
experiences a large stress for a long period of time a solid 
can slowly and permanently change shape. The time- 
dependent deformation is called plastic flow and the 
capacity of the solid to flow is called its ductility. The duc¬ 
tility of a solid above its yield stress depends on tempera¬ 
ture and confining pressure, and materials that are brittle 
under ordinary conditions may be ductile at high temper¬ 
ature and pressure. The behavior of rocks and minerals in 
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(b) continental lithosphere 
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Fig. 2.69 Hypothetical vertical profiles of rigidity in (a) oceanic 
lithosphere and (b) continental lithosphere with the estimated depths of 
brittle-ductile transitions (after Molnar, 1988). 


the deep interior of the Earth is characterized by ductile 
deformation. 

The transition from brittle to ductile types of deforma¬ 
tion is thought to occur differently in oceanic and conti¬ 
nental lithosphere (Fig. 2.69). The depth of the transition 
depends on several parameters, including the composition 
of the rocks, the local geothermal gradient, initial crustal 
thickness and the strain rate. Consequently it is sensitive to 
the vertically layered structure of the lithosphere. The 
oceanic lithosphere has a thin crust and shows a gradual 
increase in strength with depth, reaching a maximum in the 
upper mantle at about 3(M0 km depth. At greater depths 
the lithosphere gradually becomes more ductile, eventually 
grading into the low-rigidity asthenosphere below about 
100 km depth. The continental crust is much thicker than 
the oceanic crust and has a more complex layering. The 
upper crust is brittle, but the minerals of the lower crust are 
weakened by high temperature. As a result the lower crust 
becomes ductile near to the Moho at about 30-35 km 
depth. In the upper mantle the strength increases again, 
leading to a second brittle-ductile transition at about 
40-50 km depth. The difference in rheological layering of 
continental and oceanic lithosphere is important in colli¬ 
sions between plates. The crustal part of the continental 
lithosphere may detach from the mantle. The folding, 
underthrusting and stacking of crustal layers produce 
folded mountain ranges in the suture zone and thickening 
of the continental crust. For example, great crustal thick¬ 
nesses under the Himalayas are attributed to underthrust¬ 
ing of crust from the Indian plate beneath the crust of the 
Eurasian plate. 

2.8.2 Viscous flow in liquids 

Consider the case when a liquid or gas flows in thin layers 
parallel to a flat surface (Fig. 2.70). The laminar flow 
exists as long as the speed stays below a critical value, 


Z 



? horizontal surface 


Fig. 2.70 Schematic representation of laminar flow of a fluid in 
infinitesimally thin layers parallel to a horizontal surface. 


above which the flow becomes turbulent. Turbulent flow 
does not interest us here, because the rates of flow in solid 
earth materials are very slow. 

Suppose that the velocity of laminar flow along the hor¬ 
izontal .v-direction increases with vertical height z above 
the reference surface. The molecules of the fluid may be 
regarded as having two components of velocity. One com¬ 
ponent is the velocity of flow in the .v-direction, but in 
addition there is a random component with a variable 
velocity whose root-mean-square value is determined by 
the temperature (Section 4.2.2). Because of the random 
component, one-sixth of the molecules in a unit volume 
are moving upward and one-sixth downward on average at 
any time. This causes a transfer of molecules between adja¬ 
cent layers in the laminar flow. The number of transfers per 
second depends on the size of the random velocity compo¬ 
nent, i.e., on temperature. The influx of molecules from the 
slower-moving layer reduces the momentum of the faster- 
moving layer. In turn, molecules transferred downward 
from the faster-velocity layer increase the momentum of 
the slower-velocity layer. This means that the two layers do 
not move freely past each other. They exert a shear force - 
or drag - on each other and the fluid is said to be viscous. 

The magnitude of the shear force F xz (Section 3.2) 
depends on how much momentum is transferred from 
one layer to the next. If all the molecules in a fluid have 
the same mass, the momentum transfer is determined by 
the change in the velocity v x between the layers; this 
depends on the vertical gradient of the flow velocity 
(dv v /d~). The momentum exchange depends also on the 
number of molecules that cross the boundary between 
adjacent layers and so is proportional to the surface area, 
A. We can bring these observations together, as did 
Newton in the seventeenth century, and derive the follow¬ 
ing proportionality relationship for F xz : 

dv 

^ d ; (2.H0) 


If we divide both sides by the area A, the left side becomes 
the shear stress <r xz . Introducing a proportionality con¬ 
stant 17 we get the equation 


dv x 

^ =rl dz 


( 2 . 111 ) 
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This equation is Newton’s law of viscous flow and 77 is the 
coefficient of viscosity. If 77 is constant, the fluid is called a 
Newtonian fluid. The value of 17 depends on the transfer 
rate of molecules between layers and so on temperature. 
Substituting the units of stress (pascal) and velocity gradi¬ 
ent ((ms -1 )/ m = s -1 ) we find that the unit of 17 is a pascal- 
second (Pa s). A shear stress applied to a fluid with low 
viscosity causes a large velocity gradient; the fluid flows 
easily. This is the case in a gas (77 in air is of the order 2 X 
10 -5 Pa s) or in a liquid (77 in water is 1.005 X 10 -3 Pa s at 
20 °C). The same shear stress applied to a very viscous fluid 
(with a large value of 77 ) produces only a small velocity gra¬ 
dient transverse to the flow direction. The layers in the 
laminar flow are reluctant to move past each other. The 
viscous fluid is “sticky” and it resists flow. For example, 77 
in a viscous liquid like engine oil is around 0.1-10 Pa s, 
three or four orders of magnitude higher than in water. 

2.8.3 Flow in solids 

The response of a solid to an applied load depends upon 
whether the stress exceeds the elastic limit (Section 3.2.1) 
and for how long it is applied. When the yield stress 
(elastic limit) is reached, a solid may deform continuously 
without further increase in stress. This is called plastic 
deformation. In perfectly plastic behavior the stress- 
strain curve has zero slope, but the stress-strain curves of 
plastically deformed materials usually have a small posi¬ 
tive slope (see Fig. 3.2a). This means that the stress must 
be increased above the yield stress for plastic deformation 
to advance. This effect is called strain-hardening. When 
the stress is removed after a material has been strain- 
hardened, a permanent residual strain is left. 

Consider the effects that ensue if a stress is suddenly 
applied to a material at time t 0 , held constant until time t { 
and then abruptly removed (Fig. 2.71a). As long as the 
applied stress is lower than the yield stress, the solid 
deforms elastically. The elastic strain is acquired immedi¬ 
ately and remains constant as long as the stress is applied. 
Upon removal of the stress, the object at once recovers its 
original shape and there is no permanent strain (Fig. 
2.71b). 

If a constant load greater than the yield stress is 
applied, the resulting strain consists of a constant elastic 
strain and a changing plastic strain, which increases with 
time. After removal of the load at t { the plastic deforma¬ 
tion does not disappear but leaves a permanent strain 
(Fig. 2.71c). In some plastic materials the deformation 
increases slowly at a decreasing rate, eventually reaching a 
limiting value for any specific value of the stress. This is 
called viscoelastic deformation (Fig. 2.71d). When the 
load is removed at t v the elastic part of the deformation is 
at once restored, followed by a slow decrease of the resid¬ 
ual strain. This phase is called recovery or delayed elastic¬ 
ity. Viscoelastic behavior is an important rheological 
process deep in the Earth, for example in the asthenos- 
phere and deeper mantle. 
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Fig. 2.71 (a) Application of a constant stress a to a solid between times 
f 0 and f,; (b) variation of elastic strain below the yield point; (c) plastic 
strain and (d) viscoelastic deformation at stresses above the yield 
point (jy. 


The analogy to the viscosity of liquids is apparent by 
inspection of Eq. (2.111). Putting v x = dxldt and chang¬ 
ing the order of differentiation, the equation becomes 


d dx _ d d.v 
°y- ^d:dt ^ dtd: 


= 7 'dt e ™ 


( 2 . 112 ) 


This equation resembles the elastic equation for shear 
deformation, which relates stress and strain through the 
shear modulus (see Eq. (3.16)). However, in the case of 
the “viscous flow” of a solid the shear stress depends on 
the strain rate. The parameter 77 for a solid is called the 
viscosity modulus , or dynamic viscosity. It is analogous to 
the viscosity coefficient of a liquid but its value in a solid 
is many orders of magnitude larger. For example, the vis¬ 
cosity of the asthenosphere is estimated to be of the order 
of 10 20 —10 21 Pa s. Plastic flow in solids differs from true 
flow in that it only occurs when the stress exceeds the yield 
stress of the solid. Below this stress the solid does not 
flow. However, internal defects in a metal or crystal can be 
mobilized and reorganized by stresses well below the yield 
stress. As a result the solid may change shape over a long 
period of time. 


2.83.1 Viscoelastic model 

Scientists have tried to understand the behavior of rocks 
under stress by devising models based on mechanical 
analogs. In 1890 Lord Kelvin modelled viscoelastic defor¬ 
mation by combining the characteristics of a perfectly 
elastic solid and a viscous liquid. An applied stress causes 
both elastic and viscous effects. If the elastic strain is e, 
the corresponding elastic part of the stress is Es, where E 
is Young’s modulus (Section 3.2.4). Similarly, if the rate 
of change of strain with time is de/dr, the viscous part of 
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the applied stress is 17 de/dr, where 17 is the viscosity 
modulus. The applied stress cr is the sum of the two parts 
and can be written 

ar = Es + i 7 ^ (2.113) 


To solve this equation we first divide throughout by E, 
then define the retardation time r = rilE, which is a measure 
of how long it takes for viscous strains to exceed elastic 
strains. Substituting and rearranging the equation we get 


e 

e 

T 


de cr 

+ T d7 = E 

. de _ e m 

dr “ T 


(2.114) 

(2.115) 



where e m = alE. Multiplying throughout by the integrat¬ 
ing factor e ,/T gives 

fe ,/T + ^e' /T = ^e ,/T (2.116) 

^(ee' /T )=^e" T (2.117) 

Integrating both sides of this equation with respect to t 
gives 

ee' /T = e m e' /T + C (2.118) 

where C is a constant of integration determined by the 
boundary conditions. Initially, the strain is zero, i.e., at 
r = 0, 8 = 0; substituting in Eq. (2.118) gives C=e m . The 
solution for the strain at time t is therefore 



Fig. 2.72 Hypothetical strain-time curve for a material exhibiting creep 
under constant stress, and (b) model creep curve that combines elastic, 
viscoelastic and viscous elements (after Ramsay, 1967). 


e = e m (l — e -,/T ) (2.119) 

The strain rises exponentially to a limiting value given by 
e m = <t/E. This is characteristic of viscoelastic deformation. 

2.8.4 Creep 

Many solid materials deform slowly at room temperature 
when subjected to small stresses well below their brittle 
strength for long periods of time. The slow time- 
dependent deformation is known as creep. This is an 
important mechanism in the deformation of rocks 
because of the great intervals of time involved in geologi¬ 
cal processes. It is hard enough to approximate the condi¬ 
tions of pressure and temperature in the real Earth, but 
the time factor is an added difficulty in investigating the 
phenomenon of creep in laboratory experiments. 

The results of observations on rock samples loaded by a 
constant stress typically show three main regimes of creep 
(Fig. 2.72a). At first, the rock at once strains elastically. 
This is followed by a stage in which the strain initially 
increases rapidly (i.e., the strain rate is high) and then levels 
off. This stage is known as primary creep or delayed elastic 
creep. If the stress is removed within this regime, the defor¬ 
mation drops rapidly as the elastic strain recovers, leaving a 


deformation that sinks progressively to zero. Beyond the 
primary stage creep progresses at a slower and nearly con¬ 
stant rate. This is called secondary creep or steady-state 
creep. The rock deforms plastically, so that if the stress is 
removed a permanent strain is left after the elastic and 
delayed elastic recoveries. After the secondary stage the 
strain rate increases ever more rapidly in a stage called ter¬ 
tiary creep, which eventually leads to failure. 

The primary and secondary stages of the creep curve 
can be modelled by combining elastic, viscoelastic and 
viscous elements (Fig. 2.72b). Below the yield stress only 
elastic and delayed elastic (viscoelastic) deformation 
occur and the solid does not flow. The strain flattens off at 
a limiting value e m = crlE. The viscous component of 
strain rises linearly with time, corresponding to a constant 
strain rate. In practice the stress must exceed the yield 
stress <t for flow to occur. In this case the viscous compo¬ 
nent of strain is proportional to the excess stress (a — a y ) 
and to the time t. Combining terms gives the expression 

o- , (cr~a) 

8 = f+8 m (l-e-^)+- y^t (2.120) 

This simple model explains the main features of experi¬ 
mentally observed creep curves. It is in fact very difficult to 
ensure that laboratory observations are representative of 
creep in the Earth. Conditions of pressure and temperature 
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Fig. 2.73 Permanent (plastic) 
shear deformation produced 
by motion of a dislocation 
through a crystal in response 
to shear stress: 

(a) undeformed crystal lattice, 

(b) entry of dislocation at left 
edge, (c) accommodation of 
dislocation into lattice, (d) 
passage of dislocation across 
the crystal, and (e) sheared 
lattice after dislocation leaves 
the crystal. 



glide 

plane 






in the crust and upper mantle can be achieved or approxi¬ 
mated. The major problems arise from the differences in 
timescales and creep rates. Even creep experiments con¬ 
ducted over months or years are far shorter than the 
lengths of time in a geological process. The creep rates in 
nature (e.g., around 10 -14 s -1 ) are many orders of magni¬ 
tude slower than the slowest strain rate used in laboratory 
experiments (around 10 -8 s _1 ). Nevertheless, the experi¬ 
ments have provided a better understanding of the rheol¬ 
ogy of the Earth's interior and the physical mechanisms 
active in different depths. 

2.84.1 Crystal defects 

Deformation in solids does not take place homogeneously. 
Laboratory observations on metals and minerals have 
shown that crystal defects play an important role. The 
atoms in a metal or crystal are arranged regularly to form a 
lattice with a simple symmetry. In some common arrange¬ 
ments the atoms are located at the corners of a cube or a 
hexagonal prism, defining a unit cell of the crystal. The 
lattice is formed by stacking unit cells together. 
Occasionally an imperfect cell may lack an atom. The 
space of the missing atom is called a vacancy. Vacancies 
may be distributed throughout the crystal lattice, but they 
can also form long chains called dislocations. 

There are several types of dislocation, the simplest being 
an edge dislocation. It is formed when an extra plane of 
atoms is introduced in the lattice (Fig. 2.73). The edge dis¬ 
location terminates at a plane perpendicular to it called the 
glide plane. It clearly makes a difference if the extra plane of 
atoms is above or below the glide plane, so edge disloca¬ 
tions have a sign. This can be represented by a T-shape, 
where the cross-bar of the T is parallel to the glide plane 
and the stalk is the extra plane of atoms. If oppositely 
signed edge dislocations meet, they form a complete plane 
of atoms and the dislocations annihilate each other. The 
displacement of atoms in the vicinity of a dislocation 
increases the local internal stress in the crystal. As a result, 


the application of a small external stress may be enough to 
mobilize the dislocation, causing it to glide through the 
undisturbed part of the crystal (Fig. 2.73). If the disloca¬ 
tion glide is not blocked by an obstacle, the dislocation 
migrates out of the crystal, leaving a shear deformation. 

Another common type of dislocation is the screw dis¬ 
location. It also is made up of atoms that are displaced 
from their regular positions, in this case forming a spiral 
about an axis. 

The deformation of a crystal lattice by dislocation 
glide requires a shear stress; hydrostatic pressure does not 
cause plastic deformation. The shear stress needed to 
actuate dislocations is two or three orders of magnitude 
less than the shear stress needed to break the bonds 
between layers of atoms in a crystal. Hence, the mobiliza¬ 
tion of dislocations is an important mechanism in plastic 
deformation at low stress. As deformation progresses it is 
accompanied by an increase in the dislocation density 
(the number of dislocations per unit area normal to their 
lengths). The dislocations move along a glide plane until 
it intersects the glide plane of another set of dislocations. 
When several sets of dislocations are mobilized they may 
interfere and block each other, so that the stress must be 
increased to mobilize them further. This is manifest as 
strain-hardening, which is a thermodynamically unstable 
situation. At any stage of strain-hardening, given enough 
time, the dislocations redistribute themselves to a config¬ 
uration with lower energy, thereby reducing the strain. 
The time-dependent strain relaxation is called recovery. 

Recovery can take place by several processes, each of 
which requires thermal energy. These include the annihi¬ 
lation of oppositely signed edge dislocations moving on 
parallel glide planes (Fig. 2.74a) and the climb of edge 
dislocations past obstacles against which they have piled 
up (Fig. 2.74b). Edge dislocations with the same sign may 
align to form walls between domains of a crystal that 
have low dislocation density (Fig. 2.74c), a process called 
polygonization. These are some of the ways in which 
thermal energy promotes the migration of lattice defects, 
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Fig. 2.74 Thermally activated processes that assist recovery: (a) 
annihilation of oppositely signed edge dislocations moving on parallel 
glide planes, (b) climb of edge dislocations past obstacles, and (c) 
polygonization by the alignment of edge dislocations with the same 
sign to form walls separating regions with low dislocation density 
(after Ranalli, 1987). 


eventually driving them out of the crystal and leaving an 
annealed lattice. 

2.84.2 Creep mechanisms in the Earth 

Ductile flow in the Earth's crust and mantle takes place by 
one of three mechanisms: low-temperature plastic flow; 
power-law creep; or diffusion creep. Each mechanism is a 
thermally activated process. This means that the strain rate 
depends on the temperature T according to an exponential 
function with the form Q~ E a! kT . Here k is Boltzmann's con¬ 
stant, while E, d is the energy needed to activate the type of 
flow; it is called the activation energy. At low temperatures, 
where T<EJk , the strain rate is very slow and creep is 
insignificant. Because of the exponential function, the 
strain rate increases rapidly with increasing temperature 
above T=EJk. The type of flow at a particular depth 
depends on the local temperature and its relationship to 
the melting temperature T mp . Above T the interatomic 
bonds in the solid break down and it flows as a true liquid. 

Plastic flow at low temperature takes place by the 
motion of dislocations on glide planes. When the disloca¬ 
tions encounter an internal obstacle or a crystal bound¬ 
ary they pile up and some rearrangement is necessary. 


The stress must be increased to overcome the obstacle and 
reactuate dislocation glide. Plastic flow can produce large 
strains and may be an important mechanism in the 
bending of the oceanic lithosphere near some subduction 
zones. It is likely to be most effective at depths below the 
brittle-ductile transition. 

Power-law creep, or hot creep, also takes place by the 
motion of dislocations on glide planes. It occurs at higher 
temperatures than low-temperature plastic flow, so inter¬ 
nal obstacles to dislocation migration are thermally acti¬ 
vated and diffuse out of the crystal as soon as they arise. 
The strain rate in power-law creep is proportional to the 
«th power of the stress <x and has the form 

f t =A (^)"e-W (2 ' 121) 

where p is the rigidity modulus and A is a constant with the 
dimensions of strain rate; typically n>3. This relationship 
means that the strain rate increases much more rapidly 
than the stress. From experiments on metals, power-law 
creep is understood to be the most important mechanism 
of flow at temperatures between 0.55T mp and 0.85r mp . The 
temperature throughout most of the mantle probably 
exceeds half the melting point, so power-law creep is prob¬ 
ably the flow mechanism that permits mantle convection. 
It is also likely to be the main form of deformation in the 
lower lithosphere, where the relationship of temperature to 
melting point is also suitable. 

Diffusion creep consists of the thermally activated 
migration of crystal defects in the presence of a stress field. 
There are two main forms. Nabarro-Herring creep consists 
of diffusion of defects through the body of a grain; Coble 
creep takes place by migration of the defects along grain 
boundaries. In each case the strain rate is proportional to 
the stress, as in a Newtonian fluid. It is therefore possible to 
regard ductile deformation by diffusion creep as the slow 
flow of a very viscous fluid. Diffusion creep has been 
observed in metals at temperatures r>0.85r mp . In the 
Earth’s mantle the temperature approaches the melting 
point in the asthenosphere. As indicated schematically in 
Fig. 2.69 the transition from the rigid lithosphere to the 
soft, viscous underlying asthenosphere is gradational. 
There is no abrupt boundary, but the concept of rigid 
lithospheric plates moving on the soft, viscous asthenos¬ 
phere serves well as a geodynamic model. 

2.8.5 Rigidity of the lithosphere 

Lithospheric plates are thin compared to their horizontal 
extents. However, they evidently react rigidly to the forces 
that propel them. The lithosphere does not easily buckle 
under horizontal stress. A simple analogy may be made to 
a thin sheet of paper resting on a flat pillow. If pushed on 
one edge, the page simply slides across the pillow without 
crumpling. Only if the leading edge encounters an obsta¬ 
cle does the page bend, buckling upward some distance in 
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front of the hindrance, while the leading edge tries to 
burrow under it. This is what happens when an oceanic 
lithospheric plate collides with another plate. A small 
forebulge develops on the oceanic plate and the leading 
edge bends downward into the mantle, forming a subduc- 
tion zone. 

The ability to bend is a measure of the rigidity of the 
plate. This is also manifest in its reaction to a local verti¬ 
cal load. If, in our analogy, a small weight is placed in the 
middle of the page, it is pressed down into the soft pillow. 
A large area around the weight takes part in this process, 
which may be compared with the Vening Meinesz type of 
regional isostatic compensation (Section 2.123). The 
weight of a seamount or chain of islands has a similar 
effect on the oceanic lithosphere. By studying the flexure 
due to local vertical loads, information is obtained about 
a static property of the lithosphere, namely its resistance 
to bending. 

In our analogy the locally loaded paper sheet would 
not bend if it lay on a hard flat table. It is only able to flex 
if it rests on a soft, yielding surface. After the weight is 
removed, the page is restored to its original flat shape. The 
restoration after unloading is a measure of the properties 
of the pillow as well as the page. A natural example is the 
rebound of regions (such as the Canadian shield or 
Fennoscandia) that have been depressed by now-vanished 
ice-sheets. The analysis of the rates of glacial rebound 
provides information about a dynamic property of the 
mantle beneath the lithosphere. The depression of the 
surface forces mantle material to flow away laterally to 
make way for it; when the load is removed, the return 
mantle flow presses the concavity back upward. The ease 
with which the mantle material flows is described by its 
dynamic viscosity. 

The resistance to bending of a thin elastic plate overly¬ 
ing a weak fluid is expressed by an elastic parameter 
called the flexural rigidity and denoted D. For a plate of 
thickness h, 




( 2 . 122 ) 


where E is Young’s modulus and v is Poisson’s ratio (see 
Sections 3.2.3 and 3.2.4 for the definition of these elastic 
constants). The dimensions of E are Nm~ 2 and v is 
dimensionless; hence, the dimensions of D are those of a 
bending moment (N m). D is a fundamental parameter of 
the elastic plate, which describes how easily it can be bent; 
a large value of D corresponds to a stiff plate. 

Here we consider two situations of particular interest 
for the rigidity of the oceanic lithosphere. The first is the 
bending of the lithosphere by a topographic feature such 
as an oceanic island or seamount; only the vertical load 
on the elastic plate is important. The second is the 
bending of the lithosphere at a subduction zone. In this 
case vertical and horizontal forces are located along the 
edge of the plate and the plate experiences a bending 
moment which deflects it downward. 




Fig. 2.75 (a) Geometry for the elastic bending of a thin plate of 
thickness h supported by a denser substratum: the surface load L causes 
a downward bending w. (b) Comparison of 2D and 3D elastic plate 
models. The load is taken to be a topographic feature of density 
2800 kg rrr 3 , height 5 km and cross-sectional width 112km (after 
Watts ef a/., 1975). 


2.8.5.1 Lithospheric flexure caused by oceanic islands 

The theory for elastic bending of the lithosphere is 
derived from the bending of thin elastic plates and beams. 
This involves a fourth-order differential equation, whose 
derivation and solution are beyond the scope of this 
book. However, it is instructive to consider the forces 
involved in setting up the equation, and to examine its 
solution in a simplified context. 

Consider the bending of a thin isotropic elastic plate 
of thickness h carrying a surface load L(x, y) and sup¬ 
ported by a substratum of density p m (Fig. 2.75a). Let 
the deflection of the plate at a position (x, v) relative to 
the center of the load be w(x, v). Two forces act to coun¬ 
teract the downward force of the load. The first, arising 
from Archimedes’ principle, is a buoyant force equal to 
(p m — fflgw, where p i is the density of the material that 
fills in the depression caused by the deflection of the 
plate. The second force arises from the elasticity of the 
beam. Elasticity theory shows that this produces a restor¬ 
ing force proportional to a fourth-order differential of 
the deflection w. Balancing the elastic and buoyancy 
forces against the deforming load leads to the equation 


D 


d 4 W 
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dx 2 dy 2 


l^fj + (p m - Pi)gw = L(x,y) (2-123) 
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For a linear topographic feature, such as a mountain 
range, oceanic island or chain of seamounts, the bending 
geometry is the same in any cross-section normal to its 
length and the problem reduces to the two-dimensional 
elastic bending of a thin beam. If the load is a linear 
feature in the v-direction, the variation of w with y disap¬ 
pears and the differential equation becomes: 

D J^+(Pm-Pi)g™ = L (2-124) 


(a) gravity anomaly 



An important example is a linear load L concentrated 
along the y-axis at x = 0. The solution is a damped sinu¬ 
soidal function: 

w = w 0 e-^“(cosi + sin|) (2.125) 


where vv 0 is the amplitude of the maximum deflection 
underneath the load (at x = 0). The parameter a is called 
the flexural parameter, it is related to the flexural rigidity 
D of the plate by 


a 


4 — 


AD 

( P m ~Pi)g 


(2.126) 


The elasticity of the plate (or beam) distributes the load 
of the surface feature over a large lateral distance. The fluid 
beneath the load is pushed aside by the penetration of the 
plate. The buoyancy of the displaced fluid forces it upward, 
causing uplift of the surface adjacent to the central depres¬ 
sion. Equation (2.125) shows that this effect is repeated 
with increasing distance from the load, the wavelength of 
the fluctuation is A=2Tra. The amplitude of the distur¬ 
bance diminishes rapidly because of the exponential atten¬ 
uation factor. Usually it is only necessary to consider the 
central depression and the first uplifted region. The wave¬ 
length A is equal to the distance across the central depres¬ 
sion. Substituting in Eq. (2.126) gives D, which is then used 
with the parameters E and v in Eq. (2.122) to obtain h, the 
thickness of the elastic plate. The computed values of h are 
greater than the thickness of the crust, i.e., the elastic plate 
includes part of the upper mantle. The value of h is 
equated with the thickness of the elastic lithosphere. 

The difference between the deflection caused by a two- 
dimensional load (i.e., a linear feature) and that due to a 
three-dimensional load (i.e., a feature that has limited 
extent in the x- and v-directions) is illustrated in Fig. 
2.75b. If the length of the three-dimensional load normal 
to the cross-section is more than about ten times its width, 
the deflection is the same as for a two-dimensional load. 
A load with a square base (i.e., extending the same dis¬ 
tance along both x- and y-axes) causes a central depres¬ 
sion that is less than a quarter the effect of the linear load. 

The validity of the lithospheric flexural model of 
isostasy can be tested by comparing the computed gravity 
effect of the model with the observed free-air gravity 
anomaly Ag F . The isostatic compensation of the Great 
Meteor seamount in the North Atlantic provides a suit¬ 
able test for a three-dimensional model (Fig. 2.76). The 


Lat. 30°N 


(b) flexure model Long. 28°W 



Fig. 2.76 (a) Comparison of observed free-air gravity anomaly profile 
across the Great Meteor seamount with the anomaly computed for (b) a 
lithospheric flexure model of isostatic compensation (after Watts ef a/., 
1975). 

shape of the free-air gravity anomaly obtained from 
detailed marine gravity surveys was found to be fitted best 
by a model in which the effective flexural rigidity of the 
deformed plate was assumed to be 6 X 10 22 N m. 

2.8.52 Lithospheric flexure at a subduction zone 

The bathymetry of an oceanic plate at a subduction zone is 
typified by an oceanic trench, which can be many kilome¬ 
ters deep (Fig. 2.77a). Seaward of the trench axis the plate 
develops a small upward bulge (the outer rise) which can 
extend for 100-150 km away from the trench and reach 
heights of several hundred meters. The lithospheric plate 
bends sharply downward in the subduction zone. This 
bending can also be modelled with a thin elastic plate. 

In the model, a horizontal force P pushes a plate of 
thickness h toward the subduction zone, the leading edge 
carries a vertical load L, and the plate is bent by a bending 
moment M(Fig. 2.77b). The horizontal force P is negligi¬ 
ble in comparison to the effects of M and L. The vertical 
deflection of the plate must satisfy Eq. (2.124), with the 
same parameters as in the previous example. Choosing 
the origin to be the point nearest the trench where the 
deflection is zero simplifies the form of the solution, 
which is 
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outer 

accretionary trench outer 

prism slope rise 



rise wavelength 



Fig. 2.77 (a) Schematic structural cross-section at a subduction zone 
(after Caldwell and Turcotte, 1979), and (b) the corresponding thin- 
plate model (afterTurcotte ef a/., 1978). 

w = Ae~ x/a sm§ (2.127) 

where A is a constant and a is the flexural parameter as in 
Eq. (2.126). The value of the constant A is found from the 
position x b of the forebulge, where dw/dx = 0: 




elastic-perfectly 
plastic model 


d w 
dx 
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= 0 


from which 

x b = and A = w b V / 2e’ r/4 


(2.128) 


(2.129) 


It is convenient to normalize the horizontal distance and 
vertical displacement: writing x'=x/x b and w' = w7vv b , 
the generalized equation for the elastic bending at an 
oceanic trench is obtained: 


= V2sin (fx' )exp 


fd-*') 


(2.130) 


The theoretical deflection of oceanic lithosphere at a 
subduction zone obtained from the elastic bending model 
agrees well with the observed bathymetry on profiles 
across oceanic trenches (Fig. 2.78a, b). The calculated 
thicknesses of the elastic lithosphere are of the order 
20-30 km and the flexural rigidity is around 10 23 N m. 
Flowever, at some trenches the assumption of a com¬ 
pletely elastic upper lithosphere is evidently inappropri¬ 
ate. At the Tonga trench the model curve deviates from 
the observed bathymetry inside the trench (Fig. 2.78c). It 
is likely that the elastic limit is exceeded at parts of the 
plate where the curvature is high. These regions may 
yield, leading to a reduction in the effective rigidity of the 
plate. This effect can be taken into account by assuming 
that the inelastic deformation is perfectly plastic. The 
deflection calculated within the trench for an elastic- 
perfectly plastic model agrees well with the observed 
bathymetry. 


Fig. 2.78 Observed (solid) and theoretical (dashed) bathymetric profiles 
for elastic flexure of the lithosphere at (a) the Marianas trench and (b) 
the Kuril trench. The flexure at (c) the Tonga trench is best explained by 
an elastic-perfectly plastic model (after Turcotte ef a/., 1978). 


2.8.53 Thickness of the lithosphere 

The rheological response of a material to stress depends on 
the duration of the stress. The reaction to a short-lasting 
stress, as experienced during the passage of a seismic wave, 
may be quite different from the reaction of the same mater¬ 
ial to a steady load applied for a long period of time. This is 
evident in the different thicknesses obtained for the lithos¬ 
phere in seismic experiments and in elastic plate modelling. 
Long-period surface waves penetrate well into the upper 
mantle. Long wavelengths are slowed down by the low 
rigidity of the asthenosphere, so the dispersion of surface 
waves allows estimates of the seismic thickness of the 
lithosphere. For oceanic lithosphere the seismic thickness 
increases with age of the lithosphere (i.e., with distance 
from the spreading center), increasing to more than 
100 km at ages older than 100 Ma (Fig. 2.79). The lithos¬ 
pheric thicknesses obtained from elastic modelling of the 
bending caused by seamounts and island chains or at sub¬ 
duction zones also increase with distance from the ridge, 
but are only one-third to one-half of the corresponding 
seismic thickness. The discrepancy shows that only the 
upper part of the lithosphere is elastic. Indeed, if the entire 
lithosphere had the flexural rigidity found in elastic models 
(Z>= 10 21 -10 23 N m), it would bend by only small amounts 
under topographic loads or at subduction zones. The base 
of the elastic lithosphere agrees well with the modelled 
depths of the 300-600 °C oceanic isotherms. At greater 
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Age of oceanic lithosphere (Ma) 



Fig. 2.79 Seismic and elastic thicknesses of oceanic lithosphere as a 
function of age (after Watts ef al., 1980). 


depths the increase in temperature results in anelastic 
behavior of the lower lithosphere. 

The elastic thickness of the continental lithosphere is 
much thicker than that of the oceanic lithosphere, except 
in rifts, passive continental margins and young orogenic 
belts. Precambrian shield areas generally have a flexural 
thickness greater than 100 km and a high flexural rigidity 
of around 10 25 N m. Rifts, on the other hand, have a flex¬ 
ural thickness less than 25 km. Both continental and 
oceanic lithosphere grade gradually into the asthenos- 
phere, which has a much lower rigidity and is able to flow 
in a manner determined by mantle viscosity. 

2.8.6 Mantle viscosity 

As illustrated by the transition from brittle to ductile 
behavior (Section 2.8.1), the Earth’s rheology changes 
with depth. The upper part of the lithosphere behaves 
elastically. It has a constant and reversible response to 
both short-term and long-term loads. The behavior is 
characterized by the rigidity or shear modulus p,, which 
relates the strain to the applied shear stress and so has the 
dimensions of stress (N m -2 , or Pa). The resistance of the 
lithosphere to flexure is described by the flexural rigidity 
D, which has the dimensions of a bending moment (N m). 
In a subduction zone the tight bending may locally exceed 
the elastic limit, causing parts of the plate to yield. 

The deeper part of the lithosphere does not behave 
elastically. Although it has an elastic response to abrupt 
stress changes, it reacts to long-lasting stress by ductile 
flow. This kind of rheological behavior also characterizes 
the asthenosphere and the deeper mantle. Flow takes 


Fig. 2.80 (a) Depression of the lithosphere due to a surface load (ice- 
sheet) and accompanying viscous outflow in the underlying mantle; (b) 
return flow in the mantle and surface uplift after removal of the load. 


place with a strain rate that is proportional to the stress or 
a power thereof. In the simplest case, the deformation 
occurs by Newtonian flow governed by a viscosity 
coefficient rj, whose dimensions (Pa s) express the time- 
dependent nature of the process. 

Under a surface load, such as an ice-sheet, the elastic 
lithosphere is pushed down into the viscous mantle (Fig. 
2.80a). This causes an outflow of mantle material away 
from the depressed region. When the ice-sheet melts, 
removing the load, hydrostatic equilibrium is restored and 
there is a return flow of the viscous mantle material (Fig. 
2.80b). Thus, in modelling the time-dependent reaction of 
the Earth to a surface load at least two and usually three 
layers must be taken into account. The top layer is an elastic 
lithosphere up to 100 km thick; it has infinite viscosity (i.e., 
it does not flow) and a flexural rigidity around 5 X 10 24 N m. 
Beneath it lies a low-viscosity “channel” 75-250 km thick 
that includes the asthenosphere, with a low viscosity of typ¬ 
ically 10 19 —10 20 Pa s. The deeper mantle which makes up the 
third layer has a higher viscosity around 10 21 Pa s. 

Restoration of the surface after removal of a load is 
accompanied by uplift, which can be expressed well by a 
simple exponential relaxation equation. If the initial 
depression of the surface is w’ 0 , the deflection \v(t) after 
time t is given by 

w{t) = w 0 e-' /T (2.131) 


Here r is the relaxation time, which is related to the 
mantle viscosity 17 by 



V 


(2.132) 


where p m is the mantle density, g is gravity at the depth of 
the flow and A is the wavelength of the depression, a 
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Fig. 2.81 Uplift in central Fennoscandia since the end of the last ice age 
illustrates exponential viscous relaxation with a time constant of 4400yr 
(after Cathles, 1975). 


dimension appropriate to the scale of the load (as will be 
seen in examples below). A test of these relationships 
requires data that give surface elevations in the past. These 
data come from analyses of sea-level changes, present ele¬ 
vations of previous shorelines and directly observed rates 
of uplift. The ancient horizons have been dated by radio- 
metric methods as well as by sedimentary methods such as 
varve chronology, which consists of counting the annually 
deposited pairs of silt and clay layers in laminated sedi¬ 
ments. A good example of the exponential restoration of a 
depressed region is the history of uplift in central 
Fennoscandia since the end of the last ice age some 
10,000yr ago (Fig. 2.81). If it is assumed that about 30m 
of uplift still remain, the observed uplift agrees well with 
Eq. (2.131) and gives a relaxation time of 4400 yr. 

An important factor in modelling uplift is whether the 
viscous response is caused by the mantle as a whole, or 
whether it is confined to a low-viscosity layer (or 
“channel”) beneath the lithosphere. Seismic shear-wave 
velocities are reduced in a low-velocity channel about 
100-150 km thick, whose thickness and seismic velocity are 
however variable from one locality to another. Interpreted 
as the result of softening or partial melting due to tempera¬ 
tures near the melting point, the seismic low-velocity 
channel is called the asthenosphere. It is not sharply 
bounded, yet it must be represented by a distinct layer in 
viscosity models, which indicate that its viscosity must be 
at least 25 times less than in the deeper mantle. 

The larger the ice-sheet (or other type of surface 
load), the deeper the effects reach into the mantle. By 


studying the uplift following removal of different loads, 
information is obtained about the viscosity at different 
depths in the mantle. 

2.8.6.1 Viscosity of the upper mantle 

About 18,000-20,000 years ago Lake Bonneville, the pre¬ 
decessor of the present Great Salt Lake in Utah, USA, 
had a radius of about 95 km and an estimated depth of 
about 305 m. The water mass depressed the lithosphere, 
which was later uplifted by isostatic restoration after the 
lake drained and dried up. Observations of the present 
heights of ancient shorelines show that the central part of 
the lake has been uplifted by about 65 m. Two parameters 
are involved in the process: the flexural rigidity of the 
lithosphere, and the viscosity of the mantle beneath. The 
elastic response of the lithosphere is estimated from the 
geometry of the depression that would be produced in 
isostatic equilibrium. The maximum flexural rigidity that 
would allow a 65 m deflection under a 305 m water load is 
found to be about 5 X 10 23 N m. 

The surface load can be modelled as a heavy vertical 
right cylinder with radius r, which pushes itself into the 
soft mantle. The underlying viscous material is forced 
aside so that the central depression is surrounded by a cir¬ 
cular uplifted “bulge.” After removal of the load, restora¬ 
tive uplift takes place in the central depression, the 
peripheral bulge subsides and the contour of zero-uplift 
migrates outward. The wavelength A of the depression 
caused by a load with this geometry has been found to be 
about 2.6 times the diameter of the cylindrical load. In 
the case of Lake Bonneville 2 r= 192 km, so A is about 
500 km. The mantle viscosity is obtained by assuming 
that the response time of the lithosphere was short 
enough to track the loading history quite closely. This 
implies that the viscous relaxation time r must have been 
4000 yr or less. Substitution of these values for A and r in 
Eq. (2.132) suggests a maximum mantle viscosity r; of 
about 2X 10 20 Pas in the top 250km of the mantle. A 
lower value of rj would require a thinner low-viscosity 
channel beneath the lithosphere. 

The Fennoscandian uplift can be treated in the same 
way (Fig. 2.82), but the weight and lateral expanse of the 
load were much larger. The ice-cap is estimated to have 
been about 1100 m thick and to have covered most of 
Norway, Sweden and Finland. Although the load was 
therefore somewhat elongate, it is possible to model it sat¬ 
isfactorily by a vertical cylinder of radius r= 550 km cen¬ 
tered on the northern Gulf of Bothnia (Fig. 2.83). The 
load was applied for about 20,000 yr before being removed 
10,000 yr ago. This caused an initial depression of about 
300 m, which has subsequently been relaxing (Fig. 2.82). 
The uplift rates allow upper-mantle viscosities to be esti¬ 
mated. The data are compatible with different models of 
mantle structure, two of which are compared in Fig. 2.83. 
Each model has an elastic lithosphere with a flexural rigid¬ 
ity of 5 X 10 24 N m underlain by a low-viscosity channel 
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uplift 


Fig. 2.82 Model calculations of the relaxation of the deformation 
caused by the Fennoscandian ice-sheet following its disappearance 
10,000yr ago (after Cathles, 1975). 


Model 1: 


Model 2: 


lithosphere: D = 5 x 10 24 N m 
low viscosity channel: 

thickness = 75 km 
i) = 4x 10 19 Pas 
mantle: 77 = 10 21 Pa s 


lithosphere: D = 5 x 10 24 N m 
low viscosity channel: 
thickness = 100 km 
)) = 1.3 x to 19 Pas 



Fig. 2.83 Comparison of Fennoscandian uplift rates interpreted along 
profile AB (inset) with uplift rates calculated for two different models of 
mantle viscosity, assuming the ice-sheet can be represented by a vertical 
right cylindrical load centered on the northern Gulf of Bothnia (after 
Cathles, 1975). 

and the rest of the mantle. The first model has a 75 km 
thick channel (t} = 4x 10 19 Pas) over a viscous mantle 
(r) = 10 21 Pas). The alternative model has a 100km thick 
channel (viscosity coefficient i7=1.3xl0 19 Pas) over a 



Age(ka) 


Fig. 2.84 Comparison of uplift history in the James Bay area with 
predicted uplifts for various Earth models after disappearance of the 
Wisconsin ice-sheet over North America, represented as a vertical right 
cylindrical load with radius 1650 km as in the inset (after Cathles, 1975). 
For details of model parameters see Table 2.2. 

rigid mantle. Both models yield uplift rates that agree 
quite well with the observed uplift rates. However, results 
from North America indicate that the lower mantle is not 
rigid and so the first model fits the data better. 

2.8.62 Viscosity of the lower mantle 

Geologists have developed a coherent picture of the last 
glacial stage, the Wisconsin, during which much of North 
America was covered by an ice-sheet over 3500 m thick. 
The ice persisted for about 20,000 yr and melted about 
10,000 yr ago. It caused a surface depression of about 
600 m. The subsequent history of uplift in the Janies Bay 
area near the center of the feature has been reconstructed 
using geological indicators and dated by the radiocarbon 
method (see Section 4.1.4.1). For modelling purposes the 
ice-sheet can be represented as a right cylindrical load 
with radius r= 1650 km (Fig. 2.84, inset). A load as large 
as this affects the deep mantle. The central uplift follow¬ 
ing removal of the load has been calculated for several 
Earth models. Each model has elastic parameters and 
density distribution obtained from seismic velocities, 
including a central dense core. The models differ from 
each other in the number of viscous layers in the mantle 
and the amount by which the density gradient departs 
from the adiabatic gradient (Table 2.2). The curvature of 
the observed uplift curve only fits models in which the vis¬ 
cosity of the lower mantle is around 10 21 Pa s (Fig. 2.84). 
A highly viscous lower mantle (rj = 10 23 Pa s, model 4) is 
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Table 2.2 Parameters of Earth models used in computing 
uplift rates. All models are for an elastic Earth with a dense 
core (after Catliles, 1975) 



Density 

Viscosity 


Model 

gradient 

[10 21 Pa s] 

Depth interval 

1 

adiabatic 

1 

entire mantle 

2 

adiabatic, 
except 335-635 km 

1 

entire mantle 

3 

adiabatic. 

0.1 

0-335 km 


except 335-635 km 

1 

335 km to core 

4 

adiabatic 

1 

0-985 km 



100 

985 km to core 

5 

adiabatic 

1 

0-985 km 



2 

985-2185 km 



3 

2185 km to core 


incompatible with the observed uplift history. The best fit 
is obtained with model 1 or 5. Each has an adiabatic 
density gradient, but model 1 has a uniform lower mantle 
with viscosity 10 21 Pa s, and model 5 has 17 increasing 
from 10 21 Pa s below the lithosphere to 3 X 10 21 Pa s just 
above the core. 

The viscoelastic properties of the Earth's interior 
influence the Earth’s rotation, causing changes in the 
position of the instantaneous rotation axis. The motion 
of the axis is traced by repeated photo-zenith tube 
measurements, in which the zenith is located by pho¬ 
tographing the stars vertically above an observatory. 
Photo-zenith tube measurements reveal systematic move¬ 
ments of the rotation axis relative to the axis of the figure 
(Fig. 2.85). Decomposed into components along the 
Greenwich meridian (X-axis) and the 90°W meridian (Y- 
axis), the polar motion exhibits a fluctuation with cycli¬ 
cally varying amplitude superposed on a linear trend. The 
amplitude modulation has a period of approximately 
seven years and is due to the interference of the 12 -month 
annual wobble and the 14-month Chandler wobble. The 
linear trend represents a slow drift of the pole toward 
northern Canada at a rate of 0.95 degrees per million 
years. It is due to the melting of the Fennoscandian and 
Laurentide ice-sheets. The subsequent uplift constitutes a 
redistribution of mass which causes modifications to the 
Earth’s moments and products of inertia, thus affecting 
the rotation. 

The observed polar drift can be modelled with 
different viscoelastic Earth structures. The models assume 
a 120 km thick elastic lithosphere and take into account 
different viscosities in the layers bounded by the seismic 
discontinuities at 400 km and 670 km depths (Section 3.7, 
Table 3.4) and the core-mantle boundary. An Earth that 
is homogeneous below the lithosphere (without a core) is 
found to give imperceptible drift. Inclusion of the core, 
with a densityjump across the core-mantle boundary and 
assuming the mantle viscosity to be around 1 X 10 21 Pa s, 
gives a drift that is perceptible but much slower than that 


( Y-coordinate j 



Fig. 2.85 Changes in the position of the instantaneous rotation axis 
from 1900 to 1975 relative to axes defined in the inset (after Peltier, 
1989). 


observed. Introduction of a density change at the 670 km 
discontinuity increases the drift markedly; the 400 km 
discontinuity does not noticeably change the drift further. 
The optimum model has an upper-mantle viscosity of 
about 1 X 10 21 Pa s and a lower-mantle viscosity of about 
3 X 10 21 Pa s. The model satisfies both the rate of drift and 
its direction (Fig. 2.85). The viscosities are comparable to 
values found by modelling post-glacial uplift (Table 2.2, 
model 5). 
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2.10 REVIEW QUESTIONS 

1. Describe the principle of operation of a gravimeter. 

2. Explain why a gravimeter only gives relative measure¬ 
ments of gravity. 

3. What is the geoicf! What is the reference ellipsoid ? 
How and why do they differ? 

4. What is a geoid anomaly? Explain how a positive (or 
negative) anomaly arises. 

5. What is normal gravity ? What does the word normal 
imply? Which surface is involved? 

6 . Gravitational acceleration is directed toward a center 
of mass. With the aid of a sketch that shows the 
directions of gravity’s components, explain why 
gravity is not a centrally directed acceleration. 

7. Write down the general expression for the normal 
gravity formula. Explain which geophysical para¬ 
meters determine each of the constants in the 
formula? 

8 . What is the topographic correction in the reduction 
of gravity data? Why is it needed? 


9. What are the Bouguer plate and free-air gravity cor¬ 
rections? 

10. What is a free-air gravity anomaly? How does it differ 
from a Bouguer anomaly? 

11. Sketch how the Bouguer gravity anomaly might vary 
on a continuous profile that extends from a continen¬ 
tal mountain range to an oceanic ridge. 

12. What is the Coriolis acceleration? How does it origi¬ 
nate? How does it affect wind patterns in the North 
and South hemispheres? 

13. What is the Eotvos gravity correction? When is it 
needed? How does it originate? 

14. Describe two borehole methods for determining the 
density of rocks around the borehole. 

15. Describe and explain the Nettleton profile method to 
determine the optimum density for interpreting a 
gravity survey. 

16. Explain how to calculate the position of the common 
center of mass of the Earth-Moon system. 

17. Explain with the aid of diagrams that show the forces 
involved, why there are two lunar tides per day. 

18. Why are the lunar tides almost equal on oppo¬ 
site sides of the Earth? Why are they not exactly 
equal? 

19. What is isostasy? What is an isostatic gravity 
anomaly? 

20. Why is the free-air gravity anomaly close to zero at 
the middle of a large crustal block that is in isostatic 
equilibrium? 

21. Describe the three models of isostasy, and explain 
how they differ from each other. 

22. What would be the geodynamic behavior of a region 
that is characterized by a negative isostatic gravity 
anomaly? 


2.11 EXERCISES 

1. Using the data in Table 1.1 calculate the gravitational 
acceleration on the surface of the Moon as a percent¬ 
age of that on the surface of the Earth. 

2. An Olympic high-jump champion jumps a record 
height of 2.45 m on the Earth. How high could this 
champion jump on the Moon? 

3. (a) Calculate the escape velocity of an object on the 
Earth, assuming a mean gravitational acceleration of 
9.81 m s -1 and mean Earth radius of 6371 km. 

(b) What is the escape velocity of the same object on 
the Moon? 

4. The equatorial radius of the Earth is 6378 km 
and gravity at the equator is 9.780 m s -2 . Compute 
the ratio m of the centrifugal acceleration at the 
equator to the gravitational acceleration at the 
equator. If the ratio m is written as Ilk, what is 
the value of kl 
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5. Given that the length of a month is 27.32 days, the 
mean gravity on Earth is 9.81 m s -2 and the Earth’s 
radius is 6371 km, calculate the radius of the Moon's 
orbit. 

6 . A communications satellite is to be placed in a geo¬ 
stationary orbit. 

(a) What must the period and orientation of the 
orbit be? 

(b) What is the radius of the orbit? 

(c) If a radio signal is sent to the satellite from a 
transmitter at latitute 45°N, what is the shortest 
time taken for its reflection to reach the Earth? 

7. Calculate the centrifugal acceleration due to the 
Earth’s rotation of an object at rest on the Earth’s 
surface in Paris, assuming a latitude of 48° 52' N. 
Express the result as a percentage of the gravitational 
attraction on the object. 

8 . A solid angle (Q) is defined as the quotient of the area 
(A) of the part of a spherical surface subtended by 
the angle, divided by the square of the spherical 
radius (r): i.e., Q. = Alr 2 (see Box 5.4). Show with the 
aid of a diagram that the gravitational acceleration at 
any point inside a thin homogeneous spherical shell is 
zero. 

9. Assuming that the gravitational acceleration inside a 
homogeneous spherical shell is zero, show that the 
gravitational acceleration inside a homogenous 
uniform solid sphere is proportional to the distance 
from its center. 

10. Show that the gravitational potential U G inside a 
homogenous uniform solid sphere of radius R at a 
distance r from its center is given by 

U G = -^GpO^-r 2 ) 

11. Sketch the variations of gravitational acceleration 
and potential inside and outside a homogeneous 
solid sphere of radius R. 

12. A thin borehole is drilled through the center of the 
Earth, and a ball is dropped into the borehole. Assume 
the Earth to be a homogenous solid sphere. Show that 
the ball will oscillate back and forth from one side of 
the Earth to the other. How long does it take to tra¬ 
verse the Earth and reach the other side? 

13. The Roche limit is the closest distance an object can 
approach a planet before being torn apart by the tidal 
attraction of the planet. For a rigid spherical moon 
the Roche limit is given by Eq. ( 6 ) in Box 2.1. 

(a) Using the planetary dimensions in Table 1.1, cal¬ 
culate the Roche limit for the Moon with respect 
to the Earth. Express the answer as a multiple of 
the Earth’s radius. 

(b) Show that, for a planet whose mean density is less 


than half that of its rigid moon, the moon would 
collide with the planet before being torn apart by 
its gravity. 

(c) Given that the Sun’s mass is 1.989 X 10 30 kg and 
that its radius is 695,500 km, calculate the Roche 
limit for the Earth with respect to the Sun. 

(d) The mean density of a comet is about 500 
kg m -3 . What is the Roche limit for comets that 
might collide with the Earth? 

(e) The mean density of an asteroid is about 2000 
kg m -3 . If an asteroid on collision course with 
the Earth has a velocity of 15 km s -1 , how much 
time will elapse between the break-up of the 
asteroid at the Roche limit and the impact of the 
remnant pieces on the Earth’s surface, assuming 
they maintain the same velocity as the asteroid? 

14. The mass M and moment of inertia C of a thick shell 
of uniform density p, with internal radius r and exter¬ 
nal radius R are given by 

M= j' 7 Tp(J ? 3 — r 3 ) C= y^-rrptR 5 — r 5 ) 


The Earth has an internal structure consisting of con¬ 
centric spherical shells. A simple model with uniform 
density in each shell is given in the following figure. 



(a) Compute the mass and moment of inertia of 
each spherical shell. 

(b) Compute the total mass and total moment of 
inertia of the Earth. 

(c) If the moment of inertia can be written C= 
kMR 2 , where M is Earth's mass and R its radius, 
what is the value of kl 

(d) What would the value of k be if the density were 
uniform throughout the Earth? 

15. By differentiating the normal gravity formula given 
by Eq. (2.56) develop an expression for the change in 
gravity with latitude. Calculate the gravity change in 
milligals per kilometer of northward displacement at 
latitude 45°. 

16. The following gravity measurements were made on a 
traverse across a rock formation. Use the combined 
elevation correction to compute the apparent density 
of the rock. 
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Elevation [m] 

Gravity [mgal] 

100 

-39.2 

150 

-49.5 

235 

-65.6 

300 

-78.1 

385 

-95.0 

430 

-104.2 


17. Show that the "half-width” w of the gravity anomaly 
over a sphere and the depth z to the center of the 
sphere are related by z = 0.652w. 

18. Assume the “thin-sheet approximation” (Eq. (2.101)) 
for the gravity anomaly over a vertical fault of density 
contrast A p and height h with mid-point at depth z Q . 

(a) What is the maximum slope of the anomaly and 
where does it occur? 

(b) Determine the relationship between the depth z Q 
and the horizontal distance w between the posi¬ 
tions where the slope of the anomaly is one-half 
the maximum slope. 

19. Calculate the maximum gravity anomaly at ground 
level over a buried anticlinal structure, modelled by a 
horizontal cylinder with radius 1000 m and density 
contrast 200 kg m -3 , when the depth of the cylinder 
axis is (a) 1500 m and (b) 5000 m. 

20. The peak A of a mountain is 1000 meters above 
the level CD of the surrounding plain, as in the 
diagram. The density of the rocks forming the 
mountain is 2800 kg m -3 , that of the surrounding 
crust is 3000 kg m -3 . Assuming that the mountain 
and its “root” are symmetric about A and that the 
system is in local isostatic equilibrium, calculate the 
depth of B below the level CD. 

A 


21. A crustal block with mean density 3000 kg m -3 is 
initially in isostatic equilibrium with the surrounding 
rocks whose density is 3200 kg m -3 , as in the figure 
(a). After subsequent erosion the above-surface 
topography is as shown in (b). The distance L 
remains constant (i.e. there is no erosion at the 
highest point A) and Airy-type isostatic equilibrium 
is maintained. Calculate in terms of L the amount by 
which the height of A is changed. Explain why A 
moves in the sense given by your answer. 



(a) (b) 


22. An idealized mountain-and-root system, as in the 
figure, is in isostatic equilibrium. The densities in 
kg nr 3 are as shown. Express the height H of the 
point A above the horizontal surface RS in terms of 
the depth D of the root B below this surface. 



B 











3 Seismology and the internal structure of the Earth 


3.1 INTRODUCTION 

Seismology is a venerable science with a long history. The 
Chinese scientist Chang Heng is credited with the inven¬ 
tion in 132 AD, nearly two thousand years ago, of the first 
functional seismoscope, a primitive but ingenious device of 
elegant construction and beautiful design that registered 
the arrival of seismic waves and enabled the observer to 
infer the direction they came from. The origins of earth¬ 
quakes were not at all understood. For centuries these fear¬ 
some events were attributed to supernatural powers. The 
accompanying destruction and loss of life were often 
understood in superstitious terms and interpreted as pun¬ 
ishment inflicted by the gods on a sinful society. Biblical 
mentions of earthquakes - e.g., in the destruction of 
Sodom and Gomorrah - emphasize this vengeful theme. 
Although early astronomers and philosophers sought to 
explain earthquakes as natural phenomena unrelated to 
spiritual factors, the belief that earthquakes were an 
expression of divine anger prevailed until the advent of the 
Age of Reason in the eighteenth century. The path to a 
logical understanding of natural phenomena was laid in 
the seventeenth century by the systematic observations of 
scientists like Galileo, the discovery and statement of phys¬ 
ical laws by Newton and the development of rational 
thought by contemporary philosophers. 

In addition to the development of the techniques of 
scientific observation, an understanding of the laws of 
elasticity and the limited strength of materials was neces¬ 
sary before seismology could progress as a science. In a 
pioneering study Galileo in 1638 described the response 
of a beam to loading, and in 1660 Hooke established the 
law of the spring. However, another 150 years passed 
before the generalized equations of elasticity were set 
down by Navier. During the early decades of the nine¬ 
teenth century Cauchy and Poisson completed the foun¬ 
dations of modern elasticity theory. 

Early descriptions of earthquake characteristics were 
necessarily restricted to observations and measurements 
in the “near-field” region of the earthquake, i.e. in com¬ 
paratively close proximity to the place where it occurred. 
A conspicuous advance in the science of seismology was 
accomplished with the invention of a sensitive and reli¬ 
able seismograph by John Milne in 1892. Although 
massive and primitive by comparison with modern 
instruments, the precision and sensitivity of this revolu¬ 


tionary new device permitted accurate, quantitative 
descriptions of earthquakes at large distances from their 
source, in their “far-field” region. The accumulation of 
reliable records of distant earthquakes (designated as 
“teleseismic” events) made possible the systematic study 
of the Earth’s seismicity and its internal structure. 

The great San Francisco earthquake of 1906 was inten¬ 
sively studied and provided an impetus to efforts at under¬ 
standing the origin of these natural phenomena, which 
were clarified in the same year by the elastic rebound 
model of H. F. Reid. Also, in 1906, R. D. Oldham pro¬ 
posed that the best explanation for the travel-times of tele- 
seismic waves through the body of the Earth required a 
large, dense and probably fluid core; the depth to its outer 
boundary was calculated in 1913 by B. Gutenberg. From 
the analysis of the travel-times of seismic body waves from 
near earthquakes in Yugoslavia, A. Mohoroviqig in 1909 
inferred the existence of the crust-mantle boundary, and 
in 1936 the existence of the solid inner core was deduced 
by I. Lehmann. The definitions of these and other discon¬ 
tinuities associated with the deep internal structure of the 
Earth have since been greatly refined. 

The needs of the world powers to detect incontrovert- 
ibly the testing of nuclear bombs by their adversaries pro¬ 
vided considerable stimulus to the science of seismology in 
the 1950s and 1960s. The amount of energy released in a 
nuclear explosion is comparable to that of an earthquake, 
but the phenomena can be discriminated by analyzing the 
directions of first motion recorded by seismographs. The 
accurate location of the event required improved knowl¬ 
edge of seismic body-wave velocities throughout the 
Earth’s interior. These political necessities of the Cold War 
led to major improvements in seismological instrumenta¬ 
tion, and to the establishment of a new world-wide network 
of seismic stations with the same physical characteristics. 
These developments had an important feedback to the 
earth sciences, because they resulted in more accurate loca¬ 
tion of earthquake epicenters and a better understanding 
of the Earth’s structure. The pattern of global seismicity, 
with its predominant concentration in narrow active zones, 
was an important factor in the development of the theory 
of plate tectonics, as it allowed the identification of plate 
margins and the sense of relative plate motions. 

The techniques of refraction and reflection seismology, 
using artificial, controlled explosions as sources, were devel¬ 
oped in the search for petroleum. Since the 1960s these 
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methods have been applied with notable success to the reso¬ 
lution of detailed crustal structure under continents and 
oceans. The development of powerful computer technology 
enabled refinements in earthquake location and in the 
determination of travel-times of seismic body waves. These 
advances led to the modern field of seismic tomography, a 
powerful and spectacular technique for revealing regions of 
the Earth’s interior that have anomalous seismic velocities. 
In the field of earthquake seismology, the need to protect 
populations and man-made structures has resulted in the 
investment of considerable effort in the study of earthquake 
prediction and the development of construction codes to 
reduce earthquake damage. 

To appreciate how seismologists have unravelled the 
structure of the Earth's interior it is necessary to under¬ 
stand what types of seismic waves can be generated by an 
earthquake or man-made source (such as a controlled 
explosion). The propagation of a seismic disturbance 
through the Earth is governed by physical properties such 
as density, and by the way in which the material of the 
Earth’s interior reacts to the disturbance. Material within 
the seismic source suffers permanent deformation, but 
outside the source the passage of a seismic disturbance 
takes place predominantly by elastic displacement of the 
medium; that is, the medium suffers no permanent defor¬ 
mation. Before analyzing the different kinds of seismic 
waves, it is important to have a good grasp of elementary 
elasticity theory. This requires understanding the con¬ 
cepts of stress and strain, and the various elastic con¬ 
stants that relate them. 

3.2 ELASTICITY THEORY 

3.2.1 Elastic, anelastic and plastic behavior of materials 

When a force is applied to a material, it deforms. This 
means that the particles of the material are displaced 
from their original positions. Provided the force does not 
exceed a critical value, the displacements are reversible; 
the particles of the material return to their original posi¬ 
tions when the force is removed, and no permanent defor¬ 
mation results. This is called elastic behavior. 

The laws of elastic deformation are illustrated by the 
following example. Consider a right cylindrical block of 
height h and cross-sectional area A, subjected to a force 
F which acts to extend the block by the amount Ah 
(Fig. 3.1). Experiments show that for elastic deforma¬ 
tion Ah is directly proportional to the applied force and 
to the unstretched dimension of the block, but is 
inversely proportional to the cross-section of the block. 
That is, AIik Fill A, 0 r 


When the area A becomes infinitesimally small, the 
limiting value of the force per unit area (FIA) is called the 
stress, a. The units of stress are the same as the units of 



Fig. 3.1 A force F acting on a bar with cross-sectional area A extends 
the original length h by the amount A h. Hooke's law of elastic 
deformation states that A h/h is proportional to F/A. 

pressure. The SI unit is the pascal, equivalent to a force of 
1 newton per square meter (1 Pa = 1 N m -2 ); the e.g.s. 
unit is the bar, equal to 106 dyne cm -2 . 

When h is infinitesimally small, the fractional change 
in dimension (A hlh) is called the strain e, which is a 
dimensionless quantity. Equation (3.1) states that, for 
elastic behavior, the strain in a body is proportional to the 
stress applied to it. This linear relationship is called 
Hooke’s law. It forms the basis of elasticity theory. 

Beyond a certain value of the stress, called the propor¬ 
tionality limit, Hooke’s law no longer holds (Fig. 3.2a). 
Although the material is still elastic (it returns to its origi¬ 
nal shape when stress is removed), the stress-strain rela¬ 
tionship is non-linear. If the solid is deformed beyond a 
certain point, known as the elastic limit, it will not recover 
its original shape when stress is removed. In this range a 
small increase in applied stress causes a disproportion¬ 
ately large increase in strain. The deformation is said to be 
plastic. If the applied stress is removed in the plastic 
range, the strain does not return to zero; a permanent 
strain has been produced. Eventually the applied stress 
exceeds the strength of the material and failure occurs. In 
some rocks failure can occur abruptly within the elastic 
range; this is called brittle behavior. 

The non-brittle, or ductile, behavior of materials under 
stress depends on the timescale of the deformation (Fig. 
3.2b). An elastic material deforms immediately upon 
application of a stress and maintains a constant strain 
until the stress is removed, upon which the strain returns 
to its original state. A strain-time plot has a box-like 
shape. However, in some materials the strain does not 
reach a stable value immediately after application of a 
stress, but rises gradually to a stable value. This type of 
strain response is characteristic of anelastic materials. 
After removal of the stress, the time-dependent strain 
returns reversibly to the original level. In plastic deforma¬ 
tion the strain keeps increasing as long as the stress is 
applied. When the stress is removed, the strain does not 
return to the original level; a permanent strain is left in 
the material. 
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Fig. 3.2 (a) The stress-strain relation for a hypothetical solid is linear 
(Hooke's law) until the proportionality limit, and the material deforms 
elastically until it reaches the elastic limit; plastic deformation produces 
further strain until failure occurs, (b) Variations of elastic, anelastic and 
plastic strains with time, during and after application of a stress. 


Our knowledge of the structure and nature of the 
Earth’s interior has been derived in large part from 
studies of seismic waves released by earthquakes. An 
earthquake occurs in the crust or upper mantle when the 
tectonic stress exceeds the local strength of the rocks and 
failure occurs. Away from the region of failure seismic 
waves spread out from an earthquake by elastic deforma¬ 
tion of the rocks through which they travel. Their propa¬ 
gation depends on elastic properties that are described by 
the relationships between stress and strain. 

3.2.2 The stress matrix 

Consider a force F acting on a rectangular prism P in a ref¬ 
erence frame defined by orthogonal Cartesian coordinate 
axes x, y and z (Fig. 3.3a). The component of F which acts 
in the direction of the x-axis is designated F x ; the force F 
is fully defined by its components F x , F y and F_. The size of 
a small surface element is characterized by its area A, 
while its orientation is described by the direction normal 
to the surface (Fig. 3.3b). The small surface with area 
normal to the x-axis is designated A x . The component of 
force F x acting normal to the surface A x produces a 


(a) z (b) z 




(c) 



Fig. 3.3 (a) Components F x , F y and F z of the force F acting in a reference 
frame defined by orthogonal Cartesian coordinate axesx, y andz. (b) 
The orientation of a small surface element with area A x is described by 
the direction normal to the surface, (c) The components of force parallel 
to the x-axis result in the normal stress o^; the components parallel to 
the y- and z-axes cause shear stresses a and <r xz . 


normal stress, denoted by a . The components of force 
along the y- and z-axes result in shear stresses a vx and a zx 
(Fig. 3.3c), given by 



Similarly, the components of the force F acting on an 
element of surface A y normal to the y-axis define a normal 
stress <r and shear stresses <r and tr, y , while the compo¬ 
nents of F acting on an element of surface A~ normal to 
the z-axis define a normal stress o\_ and shear stresses cr xz 
and <r . The nine stress components completely define the 
state of stress of a body. They are described conveniently 
by the stress matrix 


'zy 


(3.3) 


If the forces on a body are balanced to give no rotation, 
this 3x3 matrix is symmetric (i.e., a xy = <r yx , a y , = <j_ y , 
<T zx = aj and contains only six independent elements. 


3.2.3 The strain matrix 
3.23.1 Longitudinal strain 

The strains produced in a body can also be expressed by a 
3x3 matrix. Consider first the one-dimensional case 
shown in Fig. 3.4 of two points in a body located close 
together at the positions x and (x + Ax). If the point x is 
displaced by an infinitesimally small amount u in the 
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(a) 


(b) 


x x + Ax 




< u > F 


y 






< -> 

u + Au 


x + u (x + Ax) + 
(u + An) 


du 

dx 


Ax 


Fig. 3.4 Infinitesimal displacements u and (u + Au) of two points in a 
body that are located close together at the positions x and (x + Ax), 
respectively. 


direction of the x-axis, the point (x + Ax) will be dis¬ 
placed by (h + Aw), where Ah is equal to (duldx)Ax to first 
order. The longitudinal strain or extension in the x-direc- 
tion is the fractional change in length of an element along 
the x-axis. The original separation of the two points was 
Ax; one point was displaced by u, the other by (h + Am), so 
the new separation of the points is (Ax + Ah). The com¬ 
ponent of strain parallel to the x-axis resulting from a 
small displacement parallel to the x-axis is denoted e xx , 
and is given by 


e 


XX 


(Ax + gAx) 


-Ax 


Ax 


dll 

dx 


(3.4) 


The description of longitudinal strain can be ex¬ 
panded to three dimensions. If a point (x, y, z) is dis¬ 
placed by an infinitesimal amount to (x + u, y + v,z+ w), 
two further longitudinal strains e and e„ are defined by 


_ dy 
e yy ~ dy 


and 



(3.5) 


In an elastic body the transverse strains e yy and s zz are 
not independent of the strain e xx . Consider the change of 
shape of the bar in Fig. 3.5. When it is stretched parallel 
to the x-axis, it becomes thinner parallel to the y-axis and 
parallel to the z-axis. The transverse longitudinal strains 
e and e zz are of opposite sign but proportional to the 
extension s xx and can be expressed as 



Poisson's ratio: 

_ _ Ay/V _ _ Syy 

Ax/x £xx 


Fig. 3.5 Change of shape of a rectangular bar under extension. When 
stretched parallel to the x-axis, it becomes thinner parallel to they-axis 
and z-axis. 


32.3.2 Dilatation 


The dilatation 6 is defined as the fractional change in 
volume of an element in the limit when its surface area 
decreases to zero. Consider an undeformed volume element 
(as in the description of longitudinal strain) which has 
sides Ax, Ay and A: and undistorted volume V = Ax Ay A:. 
As a result of the infinitesimal displacements Ah, Av and 
An: the edges increase to Ax + Au, Ay + Av, and A: + Aw, 
respectively. The fractional change in volume is 


AF (Ax + Ah)( Ay + Av)(Az + Aw) — AxAyAz 
V AxAyAz 


AxAyAz + AuAyAz + AvAzAx + AwAxAy — AxAyAz 

AxAyAz 


Au , Ay Aw 

Ax Ay Az 


(3.7) 


where very small quantities like AuAv, AvAw, AwAu and 
AuAvAw have been ignored. In the limit, as Ax, Ay and Az 
all approach zero, we get the dilatation 


n _du . dv . dw 
0 dx dy dz 


«vv ' S yy 


(3.8) 


32.3.3 Shear strain 


The constant of proportionality v is called Poisson’s 
ratio. The values of the elastic constants of a material 
constrain v to lie between 0 (no lateral contraction) and a 
maximum value of 0.5 (no volume change) for an incom¬ 
pressible fluid. In very hard, rigid rocks like granite v is 
about 0.45, while in soft, poorly consolidated sediments it 
is about 0.05. In the interior of the Earth, v commonly 
has a value around 0.24-0.27. A body for which the value 
of v equals 0.25 is sometimes called an ideal Poisson 
body. 


During deformation a body generally experiences not 
only longitudinal strains as described above. The shear 
components of stress (a , ar y „ <j, x ) produce shear strains, 
which are manifest as changes in the angular relation¬ 
ships between parts of a body. This is most easily illus¬ 
trated in two dimensions. Consider a rectangle ABCD 
with sides Ax and Ay and its distortion due to shear 
stresses acting in the x-y plane (Fig. 3.6). As in the earlier 
example of longitudinal strain, the point A is displaced 
parallel to the x-axis by an amount u (Fig. 3.6a). Because 
of the shear deformation, points between A and D experi¬ 
ence larger x-displacements the further they are from A. 
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1 fdu. dnA 
2\dz dx) 


(3.13) 


The longitudinal and shear strains define the symmet¬ 
ric 3 X 3 strain matrix 


S x* 

B xy 

£ xz 

e vx 

B yy 

e v-- 

^zx 

S zy 

e ---- 


(3.14) 


applying Hooke’s law, the stress <r xx produces an exten¬ 
sion equal to cr xx IE in the x-direction. The stress cr yy 
causes an extension cr yy IE in the y-direction, which 
results in an accompanying transverse strain -v(cr yy IE) 
in the x-direction, where v is Poisson's ratio. Similarly, 
the stress component <x zz makes a contribution 
-HtrJE) to the total longitudinal strain e xx in the x- 
direction. Therefore, 





(3.18) 


3.2.4 The elastic constants 


According to Hooke’s law, when a body deforms elastically, 
there is a linear relationship between stress and strain. The 
ratio of stress to strain defines an elastic constant (or elastic 
modulus) of the body. Strain is itself a ratio of lengths and 
therefore dimensionless. Thus the elastic moduli must have 
the units of stress (N m -2 ). The elastic moduli, defined for 
different types of deformation, are Young’s modulus, the 
rigidity modulus and the bulk modulus. 

Young’s modulus is defined from the extensional defor¬ 
mations. Each longitudinal strain is proportional to the 
corresponding stress component, that is, 


= Ee xx 


a yy = Es yy 


= Ee_ 


(3.15) 


where the constant of proportionality, E , is Young’s mod¬ 
ulus. 

The rigidity modulus (or shear modulus ) is defined from 
the shear deformation. Like the longitudinal strains, the 
total shear strain in each plane is proportional to the cor¬ 
responding shear stress component: 

= 2 E<i xy <r yz = 2pe yz a zx = 2pe zx (3.16) 


Similar equations describe the total longitudinal strains 
e and e z _. They can be rearranged as 


Ee xx = < T xx- V<T yy- v<T zz 
E e yy = (Tyy-V , T zz -V<T xx 


(3.19) 


Eb zz = (T z z~ vtT xx~ Wyy 


Adding these three equations together we get 


E ( e xx + B yy + = ( 1 - 2v )( a xx + (J yy + ,r z-J ( 3 - 20 ) 


Consider now the effect of a constraining hydrostatic 
pressure, p, where <j xx = <x yy = cr, z = -p. Using the defini¬ 
tion of dilatation (0) in Eq. (3.8) we get 

Ed=(l-2v)(-3p) 


E={ l-2v)(-3|J 


(3.21) 


from which, using the definition of bulk modulus (K) in 
Eq. (3.17), 


where the proportionality constant, p, is the rigidity 
modulus and the factor 2 arises as explained for Eqs. 
(3.11) and (3.12). 

The hulk modulus (or incompressibility) is defined from 
the dilatation experienced by a body under hydrostatic 
pressure. Shear components of stress are zero for hydro¬ 
static conditions (a —a =a = 0 ), and the inwards 
pressure (negative normal stress) is equal in all directions 

(a xx = <r yv = a _= -p). The bulk modulus, K , is the ratio of 

the hydrostatic pressure to the dilatation, that is, 

p=-K6 (3.17) 

The inverse of the bulk modulus (^f _1 ) is called the 
compressibility. 

3.2.4.1 Bulk modulus in terms of Young's modulus and 
Poisson's ratio 

Consider a rectangular volume element subjected to 
normal stresses cr xx , cr and <r zz on its end surfaces. 
Each longitudinal strain s xx , e yy and e„ results from the 
combined effects of <j xx , cr and cr,,. For example, 


K = 


E 

3(1 - 2v) 


(3.22) 


3.2.4.2 Shear modulus in terms of Young's modulus and 
Poisson's ratio 

The relationship between g and E can be appreciated by 
considering the shear deformation of a rectangular prism 
that is infinitely long in one dimension and has a square 
cross-section in the plane of deformation. The shear causes 
shortening of one diagonal and extension of the other. Let 
the length of the side of the square be a (Fig. 3.7a) and that 
of its diagonal be d 0 (= as[2). The small shear through the 
angle </> displaces one corner by the amount (a tan cf) and 
stretches the diagonal to the new length d (Fig. 3.7b), 
which is given by Pythagoras’ theorem: 

d 2 = a 2 + {a + atan 0) 2 

= a 2 + a 2 + a 2 tan 2 0 + 2 a 2 tan<)> 

= 2a 2 ^ 1 + tan<^> + Aan 2 (/)) = <7g( 1+0) (3.23a) 
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Fig. 3.7 (a) In the undeformed state d 0 is the length of the diagonal 
of a square with side length a. (b) When the square is deformed 
by shear through an angle the diagonal is extended to the new 
length d. 


d ~ 



(3.23b) 


where for an infinitesimally small strain tan <p = (f>, and 
powers of <fi higher than first order are negligibly small. 
The extension of the diagonal is 


A d ^_(4 _ 0 

d 0 d q 2 


(3.24) 


This extension is related to the normal stresses a xx and 
cr in the x-y plane of the cross-section (Fig. 3.8a), which 
are in general unequal. Let p represent their average 
value: p = (a xx + a yv )/2. The change of shape of the 
square cross-section results from the differences A p 
between p and a xx and <r yv , respectively (Fig. 3.8b). The 
outwards stress difference A p along the .v-axis produces 
an x-extension equal to A piE, while the inwards stress 
difference along the j-axis causes contraction along the y- 
axis and a corresponding contribution to the x-extension 
equal to v(AplE), where v is Poisson’s ratio as before. The 
total x-extension Ax/x is therefore given by 

x jrO"*" 1 ') (3.25) 


Let each edge of the square represent an arbitrary area 
A normal to the plane of the figure. The stress differences 
A p produce forces f=Ap A on the edges of the square, 
which resolve to shear forces f/y2 parallel to the sides of 
the inner square defined by joining the mid-points of the 
sides of the original square (Fig. 3.8c). Normal to the 
plane of the figure the surface area represented by each 
inner side is A/\2, and therefore the tangential (shear) 
stress acting on these sides simply equals A p (Fig. 3.8d). 
The inner square shears through an angle 4>, and so we can 
write 

A p — pcf> (3.26) 


One diagonal becomes stretched in the x-direction 
while the other diagonal is shortened in the v-direction. 
The extension of the diagonal of a sheared square was 
shown above to be c/>/2. Thus, 


A p 


o + »o 


0 = A p 

2 2/x 


(3.27) 


Rearranging terms we get the relationship between p, 
E and v: 


P = 


E 

2(1 + v) 


(3.28) 


3.2.43 The Lame constants 

The first line of Eq. (3.19) can be rewritten as 

Es xx = ( 1 + v )°~ xx ~ v ^xx + a yy + ( 3 - 29 ) 


and from Eq. (3.20) we have 


( a xx + CT yy + O ( 1 — 2v) y " xx ' °yy 


( e ^ + e vv +e zz) 


(1 — 2v) 


e 


(3.30) 


where 6 is the dilatation, as defined in Eq. (3.8). After sub¬ 
stituting Eq. (3.30) in Eq. (3.29) and rearranging we get 


__ yE _ „ E 

<Txx (1 + v) (1 - 2v) (1 +vf xx 


(3.31) 


Writing 


(l+v)(l-2v) 

and substituting from Eq. (3.28) we can write Eq. (3.31) 
in the simpler form 

<r xx = ke + 2 t xs xx (3.32) 

with similar expressions for and <x„. 

yy zz 

The constants A and p are known as the Lame const¬ 
ants. They are related to the elastic constants defined 
physically above, p is equivalent to the rigidity modulus, 
while the bulk modulus K, Young’s modulus E and 
Poissons ratio v can each be expressed in terms of both A 
and p (Box 3.1). 


32.4.4 Anisotropy 

The foregoing discussion treats the elastic parameters as 
constants. In fact they are dependent on pressure and 
temperature and so can only be considered constant for 
specified conditions. The variations of temperature and 
pressure in the Earth ensure that the elastic parameters 
vary with depth. Moreover, it has been assumed that the 
relationships between stress and strain hold equally for all 
directions, a property called isotropy. This condition is 
not fulfilled in many minerals. For example, if a mineral 
has uniaxial symmetry in the arrangement of the atoms in 
its unit cell, the physical properties of the mineral parallel 
and perpendicular to the axis of symmetry are different. 
The mineral is anisotropic. The relations between compo¬ 
nents of stress and strain in an anisotropic substance are 
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Fig. 3.8 (a) Unequal normal 
stresses <r„ and a,., in the x-y 

xx yy J -w 

plane, and their average value if 

p. (b) Stress differences Ap . 

between p and <j m and cr yy , i 

respectively, cause elongation i 

parallel toxand shortening ' 

parallel toy. (c) Forces f=ApA 1 

along the sides of the original 1 

square give shear forces f/^2 \ 

along the edges of the inner ] 

square, each of which has area ] 

A^2. (d) The shear stress on | 

each side of the inner square ! 

has value Ap and causes ! 

extension of the diagonal of . 

the inner square and shear 
deformation through an 
angle y>. 


°yy 



V = (°xx + %) / 2 


(a) 


y 


-Ap 



A p 


-Ap 


A P =°xx-V=V-%, 

- x 

(b) 




(c) 


(d) 


more complex than in the perfectly elastic, isotropic case 
examined in this chapter. The elastic parameters of an 
isotropic body are fully specified by the two parameters A 
and /x, but as many as 21 parameters may be needed to 
describe anisotropic elastic behavior. Seismic velocities, 
which depend on the elastic parameters, vary with direc¬ 
tion in an anisotropic medium. 

Normally, a rock contains so many minerals that it can 
be assumed that they are oriented at random and the rock 
can be treated as isotropic. This assumption can also be 
made, at least to first order, for large regions of the 
Earth’s interior. However, if anisotropic minerals are sub¬ 
jected to stress they develop a preferred alignment with 
the stress field. For example, platy minerals tend to align 
with their tabular shapes normal to the compression axis, 
or parallel to the direction of flow of a fluid. Preferential 
grain alignment results in seismic anisotropy. This has 
been observed in seismic studies of the upper mantle, 
especially at oceanic ridges, where anisotropic velocities 
have been attributed to the alignment of crystals by con¬ 
vection currents. 


3.2.5 Imperfect elasticity in the Earth 

A seismic wave passes through the Earth as an elastic dis¬ 
turbance of very short duration lasting only some seconds 
or minutes. Elasticity theory is used to explain seismic 
wave propagation. However, materials may react differ¬ 
ently to brief, sudden stress than they do to long-lasting 
steady stress. The stress response of rocks and minerals in 
the Earth is affected by various factors, including temper¬ 
ature, hydrostatic confining pressure, and time. As a result, 
elastic, anelastic and plastic behavior occur with various 
degrees of importance at different depths. 

Anelastic behavior in the Earth is related to the petro¬ 
physical properties of rocks and minerals. If a material is 
not perfectly elastic, a seismic wave passing through it 
loses energy to the material (e.g., as frictional heating) and 
the amplitude of the wave gradually diminishes. The 
decrease in amplitude is called attenuation, and it is due to 
anelastic damping of the vibration of particles of the 
material (see Section 3.3.2.7). For example, the passage of 
seismic waves through the asthenosphere is damped owing 
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Box 3.1: Elastic parameters in terms of the Lame constants 


1. Bulk modulus (K) 

The bulk modulus describes volumetric shape changes 


of a material under the effects of the normal stresses cr xx , 
a and <r zz . Writing Hooke’s law for each normal stress 

gives 


< 7 xx = X0 + 2 P B XX 

(la) 

a yy = X9 + 2 P B yy 

(lb) 

o- zz = M + 2 /aa: zz 

(lc) 


Adding equations (la), (lb), and (1 c) together gives 

+ a yy + O'zz = 3A6 > + 2 ^ S xx + B yy + S J ( 2 ) 


tr - = A (3A+V) +2 ^- (10) 

Gathering and rearranging terms gives the following 
succession: 



3A + 2/jl ] 


2 P b xx 


( A + /a ^ _ 
°'-«\3A + 2/xj 

/ 3A + 2/a\ 
^ \ A + /a 


(ID 

( 12 ) 

(13) 


The dilatation 6 is defined by Eq. (3.8) as 

(fix* + e W . + fizz) = 6 (3) 

For hydrostatic conditions we can write a xx = a = cr„ 
= -p and substitute in Eq. (2), which can now be 
rearranged in the form 


The definition of Young’s modulus is E — cr x Js xx and 
so in terms of the Lame constants 


E = 


/ 3A + 2/a\ 


(14) 


— 3 p = 3A 9 + 2/a0 


(4) 3. Poisson's ratio (v) 


Using the definition of the bulk modulus as K=-pl9, 
we get the result 

7C = A+|la (5) 

2. Young's modulus (E) 

Young’s modulus describes the longitudinal strains 
when a uniaxial normal stress is applied to a mater¬ 
ial. When only the longitudinal stress (T xx is applied 

(i.e., <r, =o\=0), Hooke’s law becomes 

yy zz 


Poisson’s ratio is defined as r = - e v J e xx ~~ e zJ e xx• 3t 
relates the bulk modulus K and Young’s modulus E as 
developed in Eq. (3.22): 


K = 


E 

3(1 -2v) 


(15) 


Substituting the expressions derived above for K and E 
we get 


3A + 2/a j / 3A + 2/a\ 
3 = 3(1 - 2v)^\ A +/a ) 


(16) 


<7 xx = X0 + 2 P IS xx ( 6a ) 

0 = \d + 2fj.e yy (6b) 

O=A0 + 2/Ae zz (6c) 


Adding equations (6a), (6b), and (6c) together gives 


a xx = 3A0 + 2 ^ B xx + B yy + fizz) = 3A0 + 2 ^° 

17 XX = < 3A + 2/^)0 

n - <Txx 

° (3A + 2/a) 


(7) 

(B) 

(9) 


This expression is now substituted in Eq. (6a), which 
becomes 


Rearranging terms leads to the expression for Poisson’s 
ratio v in terms of the Lame constants: 


A + fJL 


1 


M (l-2v) 


( 1 - 2*0 = 


A + fi 


2(A + /a) 


(17) 

(18) 
(19) 


The values of A and /a are almost equal in some 
materials, and it is possible to assume A = /a, from 
which it follows that *> = 0.25. Such a material is called a 
Poisson solid. 


to anelastic behavior at the grain level of the minerals. 
This may consist of time-dependent slippage between 
grains; alternatively, fluid phases may be present at the 
grain boundaries. 


A material that reacts elastically to a sudden stress 
may deform and flow plastically under a stress that acts 
over a long time interval. Plastic behavior in the asthenos- 
phere and in the deeper mantle may allow material to 
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flow, perhaps due to the motion of dislocations within 
crystal grains. The flow takes place over times on the 
order of hundreds of millions of years, but it provides 
an efficient means of transporting heat out of the deep 
interior. 

3.3 SEISMIC WAVES 

3.3.1 Introduction 

The propagation of a seismic disturbance through a het¬ 
erogeneous medium is extremely complex. In order to 
derive equations that describe the propagation ade¬ 
quately, it is necessary to make simplifying assumptions. 
The heterogeneity of the medium is often modelled by 
dividing it into parallel layers, in each of which homoge¬ 
neous conditions are assumed. By suitable choice of the 
thickness, density and elastic properties of each layer, the 
real conditions can be approximated. The most important 
assumption about the propagation of a seismic distur¬ 
bance is that it travels by elastic displacements in the 
medium. This condition certainly does not apply close to 
the seismic source. In or near an earthquake focus or the 
shot point of a controlled explosion the medium is 
destroyed. Particles of the medium are displaced perma¬ 
nently from their neighbors; the deformation is anelastic. 
However, when a seismic disturbance has travelled some 
distance away from its source, its amplitude decreases and 
the medium deforms elastically to permit its passage. The 
particles of the medium carry out simple harmonic 
motions, and the seismic energy is transmitted as a 
complex set of wave motions. 

When seismic energy is released suddenly at a point P 
near the surface of a homogeneous medium (Fig. 3.9), 
part of the energy propagates through the body of the 
medium as seismic body waves. The remaining part of 
the seismic energy spreads out over the surface as a 
seismic surface wave, analogous to the ripples on the 
surface of a pool of water into which a stone has been 
thrown. 

3.3.2 Seismic body waves 

When a body wave reaches a distance r from its source in 
a homogeneous medium, the wavefront (defined as the 
surface in which all particles vibrate with the same phase) 
has a spherical shape, and the wave is called a spherical 
wave. As the distance from the source increases, the 
curvature of the spherical wavefront decreases. At great 
distances from the source the wavefront is so flat that it 
can be considered to be a plane and the seismic wave is 
called a plane wave. The direction perpendicular to the 
wavefront is called the seismic ray path. The description 
of the harmonic motion in plane waves is simpler than for 
spherical waves, because for plane waves we can use 
orthogonal Cartesian coordinates. Even for plane waves 
the mathematical description of the three-dimensional 



Fig. 3.9 Propagation of a seismic disturbance from a point source P 
near the surface of a homogeneous medium; the disturbance travels as 
a body wave through the medium and as a surface wave along the free 
surface. 

displacements of the medium is fairly complex. However, 
we can learn quite a lot about body-wave propagation 
from a simpler, less rigorous description. 

33.2.1 Compressions! waves 

Let Cartesian reference axes be defined such that the 
x-axis is parallel to the direction of propagation of the 
plane wave; the y- and z-axes then lie in the plane of 
the wavefront (Fig. 3.10). A generalized vibration of the 
medium can be reduced to components parallel to each of 
the reference axes. In the x-direction the particle motion 
is back and forward parallel to the direction of propaga¬ 
tion. This results in the medium being alternately 
stretched and condensed in this direction (Fig. 3.11a). 
This harmonic motion produces a body wave that is 
transmitted as a sequence of rarefactions and condensa¬ 
tions parallel to the x-axis. 

Consider the disturbance of the medium shown in 
Fig. 3.11b. The area of the wavefront normal to the 
x-direction is A x , and the wave propagation is treated as 
one-dimensional. At an arbitrary position x (Fig. 3.11c), 
the passage of the wave produces a displacement u and a 
force F x in the x-direction. At the position x + dx the dis¬ 
placement is u + d u and the force is F x + d F x . Here dx is 
the infinitesimal length of a small volume element which 
has mass p dx A x . The net force acting on this element in 
the x-direction is given by 

(F x + dF x )-F x = dF x = d .^dx (3.33) 

The force F x is caused by the stress element a xx acting 
on the area A x , and is equal to <t xx A x . This allows us to 
write the one-dimensional equation of motion 

(l ,dxA x )f t 2 = dx/, x d gx ( 3 - 34 ) 
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Fig. 3.10 Representation of a generalized vibration as components 
parallel to three orthogonal reference axes. Particle motion in thex- 
direction is back and forth parallel to the direction of propagation, 
corresponding to the P-wave. Vibrations along they- and z-axes are in 
the plane of the wavefront and normal to the direction of propagation. 
Thez-vibration in a vertical plane corresponds to the SV-wave; they- 
vibration is horizontal and corresponds to the SH-wave. 

(a) ooooooo o o o oooooooo o o o ooooooo 

C R C R C 


(b) 


(c) 



x x + dx 


Fig. 3.11 (a) The particle motion in a one-dimensional P-wave 
transmits energy as a sequence of rarefactions (R) and condensations 
(C) parallel to the x-axis. (b) Within the wavefront the component of 
force F x in thex-direction of propagation is distributed over an 
element of area A x normal to the x-axis. (c) A particle at position 
xexperiences a longitudinal displacement u in thex-direction, 
while at the nearby position x+ dx the corresponding displacement 
is u +d u. 


The definitions of Young’s modulus, E, (Eq. (3.15)) 
and the normal strain e xx (Eq. (3.4)) give, for a one¬ 
dimensional deformation 


°Ty = Ee xx 


j7 du 
E dx 


(3.35) 


Substitution of Eq. (3.35) into Eq. (3.34) gives the one¬ 
dimensional wave equation 


d 2 u _ v i d 2 u 
dt 2 dx 2 


(3.36) 


where V is the velocity of the wave, given by 



A one-dimensional wave is rather restrictive. It repre¬ 
sents the stretching and compressing in the x-direction as 
effects that are independent of what happens in the y- and 
z-directions. In an elastic solid the elastic strains in 
any direction are coupled to the strains in transverse direc¬ 
tions by Poisson's ratio for the medium. A three-dimen¬ 
sional analysis is given in Appendix A that takes into 
account the simultaneous changes perpendicular to the 
direction of propagation. In this case the area A x can no 
longer be considered constant. Instead of looking at the 
displacements in one direction only, all three axes must be 
taken into account. This is achieved by analyzing the 
changes in volume. The longitudinal (or compressional) 
body wave passes through a medium as a series of dilata¬ 
tions and compressions. The equation of the compressional 
wave in the x-direction is 


d 2 0 _ ? d 2 0 
dt 2 “ _ 3x 2 


(3.38) 


where a is the wave velocity and is given by 


The longitudinal wave is the fastest of all seismic 
waves. When an earthquake occurs, this wave is the first to 
arrive at a recording station. As a result it is called the 
primary wave, or P-wave. Eq. (3.39) shows that P-waves 
can travel through solids, liquids and gases, all of which 
are compressible (3f#0). Liquids and gases do not allow 
shear. Consequently, yu = 0, and the compressional wave 
velocity in a liquid or gas is given by 

(3.40) 



33.2.2 Transverse waves 

The vibrations along they- and z-axes (Fig. 3.10) are par¬ 
allel to the wavefront and transverse to the direction of 
propagation. If we wish, we can combine the y- and z- 
components into a single transverse motion. It is more 
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(b) 



Fig. 3.12 (a) Shear distortion caused by the passage of a 
one-dimensional S-wave. (b) Displacements and forces in the 
z-direction at the positions x and x + dx bounding a small 
sheared element. 

convenient, however, to analyze the motions in the vertical 
and horizontal planes separately. Here we discuss the dis¬ 
turbance in the vertical plane defined by the x- and z-axes; 
an analogous description applies to the horizontal plane. 

The transverse wave motion is akin to that seen when a 
rope is shaken. Vertical planes move up and down and 
adjacent elements of the medium experience shape distor¬ 
tions (Fig. 3.12a), changing repeatedly from a rectangle to 
a parallelogram and back. Adjacent elements of the 
medium suffer vertical shear. 

Consider the distortion of an element bounded by ver¬ 
tical planes separated by a small horizontal distance dx 
(Fig. 3.12b) at an arbitrary horizontal position x. The 
passage of a wave in the x-direction produces a displace¬ 
ment w and a force F_ in the z-direction. At the position 
x + dx the displacement is w + dvr and the force is F_ + 
d F_. The mass of the small volume element bounded by 
the vertical planes is p d xA x , where A x is the area of the 
bounding plane. The net force acting on this element in 
the z-direction is given by 

(F z + dF z )-F z = dF : = d .^dx (3.41) 

The force F, arises from the shear stress cr x , on the area 
A x , and is equal to cr x .A x . The equation of motion of the 
vertically sheared element is 

(i P dxA x = dxA x ( 3 - 42 > 


We now have to modify the Lame expression for 
Hooke’s law and the definition of shear strain so that they 
apply to the passage of a one-dimensional shear wave in 
the x-direction. In this case, because the areas of the par¬ 
allelograms between adjacent vertical planes are equal, 
there is no volume change. The dilatation 6 is zero, and 
Hooke’s law is as given in Eq. (3.16): 

<T xz = 2 F' e xz (3.43) 

Following the definition of shear-strain components 
in Eq. (3.12) we have 


_1 f dw . du \ 
e xz 2 \dx dz) 


(3.44) 


For a one-dimensional shear wave there is no change 
in the distance dx between the vertical planes; du and 
du/dz are zero and s x _ is equal to (dw/dx)l2. On substitu¬ 
tion into Eq. (3.43) this gives 


dw 

,r ^ = l x 3x 


(3.45) 


and on further substitution into Eq. (3.42) and rearrange¬ 
ment of terms we get 


d 2 w nl d 2 w 


(3.46) 


where /3 is the velocity of the shear wave, given by 
/3 = y? (3-47) 

The only elastic property that determines the velocity of 
the shear wave is the rigidity or shear modulus, p. In liquids 
and gases p is zero and shear waves cannot propagate. In 
solids, a quick comparison of Eqs. (3.39) and (3.47) gives 



(3.48) 


By definition, the bulk modulus K is positive (if it were 
negative, an increase in confining pressure would cause an 
increase in volume), and therefore a is always greater than 
ft. Shear waves from an earthquake travel more slowly 
than P-waves and are recorded at an observation station 
as later arrivals. Shear waves are often referred to as sec¬ 
ondary waves or S-waves. 

The general shear-wave motion within the plane of the 
wavefront can be resolved into two orthogonal compo¬ 
nents, one being horizontal and the other lying in the ver¬ 
tical plane containing the ray path (Fig. 3.10). Equation 
(3.46) describes a one-dimensional shear wave which 
travels in the x-direction, but which has particle displace¬ 
ments (vv) in the z-direction. This wave can be considered 
to be polarized in the vertical plane. It is called the SV- 
wave. A similar equation describes the shear wave in 
the x-direction with particle displacements (v) in the y- 
direction. A shear wave that is polarized in the horizontal 
plane is called an SH-wave. 
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As for the description of longitudinal waves, this treat¬ 
ment of shear-wave transmission is over-simplified; a more 
rigorous treatment is given in Appendix A. The passage of 
a shear wave involves rotations of volume elements within 
the plane normal to the ray path, without changing their 
volume. For this reason, shear waves are also sometimes 
called rotational (or equivoluminal) waves. The rotation is 
a vector, v[/, with x-, y- and z-components given by 


. _dw_3y . _du_dw . _ dv du 
^ ~ dy dz % ““ dz dx ^ ~ dx dy 


(3.49) 


A more appropriate equation for the shear wave in the 
x-direction is then 


d 2 4* 2 d 2 ^ 

dt 2 1 dx 2 


(3.50) 


where /3 is again the shear-wave velocity as given by Eq. 
(3.47). 

Until now we have chosen the direction of propaga¬ 
tion along one of the reference axes so as to simplify the 
mathematics. If we remove this restriction, additional 
second-order differentiations with respect to the y- and z- 
coordinates must be introduced. The P-wave and S-wave 
equations become, respectively, 


#0 = 2 (P0 M M) 

dt 2 \3x 2 dy 2 dx 2 ) 


(3.51) 


dt 2 \dx 2 dy 2 dx 2 ) 


(3.52) 


medium. The wave number ( k ), angular frequency ( co) and 
velocity (c) are defined and related by 

k = = c = kf= f (3.54) 

Equation (3.53) for the displacement (u) can then be 
written 

u = A sin (kx — cot) = A s\nk(x — ct) (3.55) 

The velocity c introduced here is called the phase veloc¬ 
ity. It is the velocity with which a constant phase (e.g., the 
"peak’' or “trough,” or one of the zero displacements) is 
transmitted. This can be seen by equating the phase to a 
constant and then differentiating the expression with 
respect to time, as follows: 

kx - cot = constant 
ci x 

k^~co = 0 (3.56) 

dx _ co _ 
dt k c 

To demonstrate that the displacement given by Eq. 
(3.55) is a solution of the one-dimensional wave equation 
(Eq. (3.38)) we must partially differentiate it in Eq. (3.55) 
twice with respect to time (t) and twice with respect to 
position (x): 

= Akcos(kx — cot) 

= — Ak 2 sin(kx — cot) = — k 2 u 


33.2.3 The solution of the seismic wave equation 

Two important characteristics of a wave motion are: (1) it 
transmits energy by means of elastic displacements of the 
particles of the medium, i.e., there is no net transfer of 
mass, and ( 2 ) the wave pattern repeats itself in both time 
and space. The harmonic repetition allows us to express 
the amplitude variation by a sine or cosine function. As the 
wave passes any point, the amplitude of the disturbance is 
repeated at regular time intervals, T, the period of the 
wave. The number of times the amplitude is repeated per 
second is the frequency, f. which is equal to the inverse of 
the period (/= 1 IT). At any instant in time, the distur¬ 
bance in the medium is repeated along the direction of 
travel at regular distances, A, the wavelength of the wave. 
During the passage of a P-wave in the x-direction, the har¬ 
monic displacement (;/) of a particle from its mean posi¬ 
tion can be written 

u = ^4sin2TT^ — (3.53) 

where A is the amplitude. 

The quantity in brackets is called the phase of the 
wave. Any value of the phase corresponds to a particular 
amplitude and direction of motion of the particles of the 


= — Acocosfkx - cot) (3.57) 

= — Aofsin) kx — cot) = — co 2 u 

d 2 u _ co 2 d 2 u _ -> d 2 u 
dt 2 k 2 dx 2 C dx 2 

For a P-wave travelling along the x-axis the dilatation 
0 is given by an equation similar to Eq. (3.57), with substi¬ 
tution of the P-wave velocity (a) for the velocity c. 
Similarly, for an S-wave along the x-axis the rotation i|/ is 
given by an equation like Eq. (3.57) with appropriate sub¬ 
stitutions of i|/ for u and the S-wave velocity (jS) for the 
velocity c. However, in general, the solutions of the three- 
dimensional compressional and shear wave equations 
(Eqs. (3.51) and (3.52), respectively) are considerably 
more complicated than those given by Eq. (3.55). 

33.2.4 D'Alembert's principle 

Equation (3.55) describes the particle displacement during 
the passage of a wave that is travelling in the direction of 
the positive x-axis with velocity c. Because the velocity 
enters the wave equation as c 2 , the one-dimensional wave 
equation is also satisfied by the displacement 

u = 7?sin/:(x + ct) 


(3.58) 
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which corresponds to a wave travelling with velocity c in 
the direction of the negative x-axis. 

In fact, any function of (x ± ct) that is itself continu¬ 
ous and that has continuous first and second derivatives is 
a solution of the one-dimensional wave equation. This is 
known ns D’Alembert’s principle. It can be simply demon¬ 
strated for the function F= fix - ct) = f(4>) as follows: 

dF = dF^ = dF a F = dFd± = _ dF 

dX d(p dx dtp dt dtp dt C dtp 

d 2 F = d dF _ d<P d i)F = d 2 F 
dx 2 dx dx dx dtp dx dtp 2 


dfF = jfdF = d$j)_dF = _ d_(_ dF\ = 2 d 2 F 

dt 2 dt dt dt dtp dt C dtp\ C dtp) C dtp 2 


df\F _ 2 dzF 
dt 2 dx 2 


(3.59) 


Because Eq. (3.59) is valid for positive and negative 
values of c, its general solution F represents the superpo¬ 
sition of waves travelling in opposite directions along the 
x-axis, and is given by 


value of t a constant phase of the wave equation solution 
given by Eq. (3.61) requires that 

lx + my + nz = constant (3.64) 

From analytical geometry we know that Eq. (3.64) 
represents a family of planes perpendicular to a line with 
direction cosines (/, in, n). We began this discussion by 
describing a wave moving with velocity c along the direc¬ 
tion x', and now we see that this direction is normal to the 
plane wavefronts. This is the direction that we defined 
earlier as the ray path of the wave. 

In a medium like the Earth the elastic properties 
and density - and therefore also the velocity - vary 
with position. The ray path is no longer a straight line 
and the wavefronts are not planar. Instead of Eq. (3.61) 
we write 

F=f[S(x,y,z)-c 0 t\ (3.65) 

where S(x, y, z) is a function of position only and c Q is a 
constant reference velocity. Substitution of Eq. (3.65) 
into Eq. (3.63) gives 


F = f(x — ct) + g(x + ct) (3.60) 

33.2.5 The eikonal equation 

Consider a wave travelling with constant velocity c along 
the axis x' which has direction cosines (/, m, n). If x' is 
measured from the center of the coordinate axes (x, y, z) 
we can substitute x' = lx + my + nz for x in Eq. (3.60). If 
we consider for convenience only the wave travelling in 
the direction of + x', we get as the general solution to the 
wave equation 



(3.66) 


where £ is known as the refractive index of the medium. 
Equation (3.66) is called the eikonal equation. It estab¬ 
lishes the equivalence of treating seismic wave propaga¬ 
tion by describing the wavefronts or the ray paths. The 
surfaces S(x, y, z) = constant represent the wavefronts (no 
longer planar). The direction cosines of the ray path 
(normal to the wavefront) are in this case given by 


F=f(lx + my + nz — ct) 


(3.61) A=£§f M = v = cf z (3.67) 


The wave equation is a second-order differential equa¬ 
tion. However, the function F is also a solution of a first- 
order differential equation. This is seen by differentiating 
F with respect to x, y, z, and t, respectively, which gives 


dF = dFd± = ] dF 
dx dtp dx dtp 

dF = dF<^ = SIF 
dz dtp dz n d(f> 


dF = dFd<h = d£ 
dy dtp dy m dtp 

c lF = dF^h = _ SIF 
dt dtp dt C dtp 


(3.62) 


33.2.6 The energy in a seismic disturbance 

It is important to distinguish between the velocity with 
which a seismic disturbance travels through a material 
and the speed with which the particles of the material 
vibrate during the passage of the wave. The vibrational 
speed (v ) is obtained by differentiating Eq. (3.55) with 
respect to time, which yields 


The direction cosines (/, m, n) are related by P + m 2 + 
n 2 = 1 , and so, as can be verified by substitution, the 
expressions in Eq. (3.62) satisfy the equation 






(3.63) 


In seismic wave theory, the progress of a wave is 
described by successive positions of its wavefront, defined 
as the surface in which all particles at a given instant in 
time are moving with the same phase. For a particular 


v p = — — coAcos (kx — rot) (3.68) 

The intensity or energy density of a wave is the energy 
per unit volume in the wavefront and consists of kinetic 
and potential energy. The kinetic part is given by 

I = \pv 2 = \pur-A 2 cos 2 {kx — cot) (3.69) 

The energy density averaged over a complete har¬ 
monic cycle consists of equal parts of kinetic and poten¬ 
tial energy; it is given by 
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I,, = Wa 2 (3.70) 

i.e., the mean intensity of the wave is proportional to the 
square of its amplitude. 


The damping of seismic waves is described by a para¬ 
meter called the quality factor ( 2 ), a concept borrowed 
from electric circuit theory where it describes the perfor¬ 
mance of an oscillatory circuit. It is defined as the frac¬ 
tional loss of energy per cycle 


33.2.7 Attenuation of seismic waves 

The further a seismic signal travels from its source the 
weaker it becomes. The decrease of amplitude with 
increasing distance from the source is referred to as atten¬ 
uation. It is partly due to the geometry of propagation of 
seismic waves, and partly due to anelastic properties of 
the material through which they travel. 

The most important reduction is due to geometric 
attenuation. Consider the seismic body waves generated 
by a seismic source at a point P on the surface of a 
uniform half-space (see Fig. 3.9). If there is no energy loss 
due to friction, the energy (E h ) in the wavefront at dis¬ 
tance r from its source is distributed over the surface of a 
hemisphere with area 2'nr 2 . The intensity (or energy 
density, 7 b ) of the body waves is the energy per unit area of 
the wavefront, and at distance r is: 


The surface wave is constricted to spread out laterally. 
The disturbance affects not only the free surface but 
extends downwards into the medium to a depth d, which 
we can consider to be constant for a given wave (Fig. 3.9). 
When the wavefront of a surface wave reaches a distance r 
from the source, the initial energy (Ej is distributed over a 
circular cylindrical surface with area Ittrd. At a distance r 
from its source the intensity of the surface wave is given by: 


4('3 


E s 

I'nrd 


(3.72) 


These equations show that the decrease in intensity of 
body waves is proportional to Hr 2 while the decrease in 
surface wave intensity is proportional to Hr. As shown in 
Eq. (3.70), the intensity of a wave-form, or harmonic 
vibration, is proportional to the square of its amplitude. 
The corresponding amplitude attenuations of body waves 
and surface waves are proportional to Mr and 1 /Vr, 
respectively. Thus, seismic body waves are attenuated 
more rapidly than surface waves with increasing distance 
from the source. This explains why, except for the records 
of very deep earthquakes that do not generate strong 
surface waves, the surface-wave train on a seismogram is 
more prominent than that of the body waves. 

Another reason for attenuation is the absorption of 
energy due to imperfect elastic properties. If the particles 
of a medium do not react perfectly elastically with their 
neighbors, part of the energy in the wave is lost (reappear¬ 
ing, for example, as frictional heat) instead of being trans¬ 
ferred through the medium. This type of attenuation of 
the seismic wave is referred to as anelastic damping. 


2ir _ _ AE 

Q~ E 


(3.73) 


In this expression A E is the energy lost in one cycle and 
E is the total elastic energy stored in the wave. If we con¬ 
sider the damping of a seismic wave as a function of the 
distance that it travels, a cycle is represented by the wave¬ 
length (A) of the wave. Equation (3.73) can be rewritten 
for this case as 


2tt_ 

Q E dr 

dE _ _ 2rrd r 
E Q A 


(3.74) 


It is conventional to measure damping by its effect on 
the amplitude of a seismic signal, because that is what is 
observed on a seismic record. We have seen that the energy 
in a wave is proportional to the square of its amplitude A 
(Eq. (3.70)). Thus we can write dEIE=2dAIA in 
Eq. (3.74), and on solving we get the damped amplitude of 
a seismic wave at distance r from its source: 


A = A 0 ex P (—^f)=4)exp(-2j) (3-75) 

In this equation D is the distance within which the 
amplitude falls to 1/e (36.8%, or roughly a third) of its 
original value. The inverse of this distance (D _1 ) is called 
the absorption coefficient. For a given wavelength, D is pro¬ 
portional to the Q-factor of the region through which the 
wave travels. A rock with a high 2‘factor transmits a 
seismic wave with relatively little energy loss by absorption, 
and the distance D is large. For body waves D is generally 
of the order of 10,000 km and damping of the waves by 
absorption is not a very strong effect. It is slightly stronger 
for seismic surface waves, for which D is around 5000 km. 

Equation (3.75) shows that the damping of a seismic 
wave is dependent on the 2 ‘factor of the region of the 
Earth that the wave has travelled through. In general the 
2-factor for P-waves is higher than the 2‘factor for 
S-waves. This may indicate that anelastic damping is deter¬ 
mined primarily by the shear component of strain. In 
solids with low rigidity, the shear strain can reach high 
levels and the damping is greater than in materials with 
high rigidity. In fluids the 2‘factor is high and damping is 
low, because shear strains are zero and the seismic wave is 
purely compressional. The values of Q are quite variable in 
the Earth: values of around 10 2 are found for the mantle, 
and around 10 3 for P-waves in the liquid core. Because Q is 
a measure of the deviation from perfect elasticity, it is also 
encountered in the theory of natural oscillations of the 
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Earth, and has an effect on fluctuations of the Earth’s free 
rotation, as in the damping of the Chandler wobble. 

It follows from Eq. (3.75) that the absorption 
coefficient (Z) -1 ) is inversely proportional to the wave¬ 
length A. Thus the attenuation of a seismic wave by 
absorption is dependent upon the frequency of the signal. 
High frequencies are attenuated more rapidly than are 
lower frequencies. As a result, the frequency spectrum of 
a seismic signal changes as it travels through the ground. 
Although the original signal may be a sharp pulse (result¬ 
ing from a shock or explosion), the preferential loss of 
high frequencies as it travels away from the source causes 
the signal to assume a smoother shape. This selective loss 
of high frequencies by absorption is analogous to remov¬ 
ing high frequencies from a sound source using a filter. 
Because the low frequencies are not affected so markedly, 
they pass through the ground with less attenuation. The 
ground acts as a low-pass filter to seismic signals. 

3.3.3 Seismic surface waves 

A disturbance at the free surface of a medium propagates 
away from its source partly as seismic surface waves. Just 
as seismic body waves can be classified as P- or S-waves, 
there are two categories of seismic surface waves, some¬ 
times known collectively as L-waves (Section 3.4.4.3), and 
subdivided into Rayleigh waves (L R ) and Love waves (Lq), 
which are distinguished from each other by the types of 
particle motion in their wavefronts. In the description of 
body waves, the motion of particles in the wavefront was 
resolved into three orthogonal components - a longitudi¬ 
nal vibration parallel to the ray path (the P-wave motion), 
a transverse vibration in the vertical plane containing the 
ray path (the vertical shear or SV-wave) and a horizontal 
transverse vibration (the horizontal shear or SH-wave). 
These components of motion, restricted to surface layers, 
also determine the particle motion and character of the 
two types of surface waves. 

3.3.3.1 Rayleigh waves (L R ) 

In 1885 Lord Rayleigh described the propagation of a 
surface wave along the free surface of a semi-infinite elastic 
half-space. The particles in the wavefront of the Rayleigh 
wave are polarized to vibrate in the vertical plane. The 
resulting particle motion can be regarded as a combination 
of the P- and SV-vibrations. If the direction of propaga¬ 
tion of the Rayleigh wave is to the right of the viewer (as in 
Fig. 3.13), the particle motion describes a retrograde ellipse 
in the vertical plane with its major axis vertical and minor 
axis in the direction of wave propagation. If Poisson’s rela¬ 
tion holds for a solid (i.e., Poisson’s ratio v = 0.25) the 
t heory of R ayleigh waves gives a speed (F LR ) equal to 
V(2 — 2/V3) = 0.9194 of the speed (J3) of S-waves (i.e., V LR 
= 0.9194/3). This is approximately the case in the Earth. 

The particle displacement is not confined entirely to 
the surface of the medium. Particles below the free 


Rayleigh wave (L R ) 



Fig. 3.13 The particle motion in the wavefront of a Rayleigh wave 
consists of a combination of P- and SV-vibrations in the vertical plane. 
The particles move in retrograde sense around an ellipse that has its 
major axis vertical and minor axis in the direction of wave propagation. 

surface are also affected by the passage of the Rayleigh 
wave; in a uniform half-space the amplitude of the parti¬ 
cle displacement decreases exponentially with increasing 
depth. The penetration depth of the surface wave is typi¬ 
cally taken to be the depth at which the amplitude is 
attenuated to (e _1 ) of its value at the surface. For 
Rayleigh waves with wavelength A the characteristic pene¬ 
tration depth is about 0.4A. 

3.3.32 Love waves (L q ) 

The boundary conditions which govern the components 
of stress at the free surface of a semi-infinite elastic half¬ 
space prohibit the propagation of SH-waves along the 
surface. However, A. E. H. Love showed in 1911 that if a 
horizontal layer lies between the free surface and the 
semi-infinite half-space (Fig. 3.14a), SH-waves within the 
layer that are reflected at supercritical angles (see Section 
3.6) from the top and bottom of the layer can interfere 
constructively to give a surface wave with horizontal par¬ 
ticle motions (Fig. 3.14b). The velocity (y3j) of S-waves in 
the near-surface layer must be lower than in the underly¬ 
ing half-space Q3.,). The velocity of the Love waves (F LQ ) 
lies between the two extreme values: jS 1 <F LQ <j3 2 . 

Theory shows that the speed of Love waves with very 
short wavelengths is close to the slower velocity (3 X of the 
upper layer, while long wavelengths travel at a speed close 
to the faster velocity /L of the lower medium. This depen¬ 
dence of velocity on wavelength is termed dispersion. 
Love waves are always dispersive, because they can only 
propagate in a velocity-layered medium. 

3.33.3 The dispersion of surface waves 

The dispersion of surface waves provides an important 
tool for determining the vertical velocity structure of the 
lower crust and upper mantle. Love waves are intrinsically 
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Fig. 3.14 In a Love wave the particle motion is horizontal and 
perpendicular to the direction of propagation. The amplitude of the 
wave decreases with depth below the free surface. 

dispersive even when the surface layer and underlying half¬ 
space are uniform. Rayleigh waves over a uniform half¬ 
space are non-dispersive. However, horizontal layers with 
different velocities are usually present or there is a vertical 
velocity gradient. Rayleigh waves with long wavelengths 
penetrate more deeply into the Earth than those with short 
wavelengths. The speed of Rayleigh waves is proportional 
to the shear-wave velocity (F LR ~ 0.92/3), and in the crust 
and uppermost mantle /3 generally increases with depth. 
Thus, the deeper penetrating long wavelengths travel with 
faster seismic velocities than the short wavelengths. As a 
result, the Rayleigh waves are dispersive. 

The packet of energy that propagates as a surface wave 
contains a spectrum of wavelengths. The energy in the 
wave propagates as the envelope of the wave packet (Fig. 
3.15a), at a speed that is called the group velocity (U). The 
individual waves that make up the wave packet travel with 
phase velocity (c), as defined in Eq. (3.56). If the phase 
velocity is dependent on the wavelength, the group veloc¬ 
ity is related to it by 

t/= t = i ^ = c + 4 = c -4 tt-76) 

The situation in which phase velocity increases with 
increasing wavelength (i.e., the longer wavelengths propa¬ 
gate faster than the short wavelengths) is called normal 
dispersion. In this case, because held A is positive, the 
group velocity U is slower than the phase velocity c. The 
shape of the wave packet changes systematically as the 
faster moving long wavelengths pass through the packet 
(Fig. 3.15b). As time elapses, an initially concentrated 
pulse becomes progressively stretched out into a long 
train of waves. Consequently, over a medium in which 
velocity increases with depth, the long wavelengths arrive 
as the first part of the surface-wave record at large dis¬ 
tances from the seismic source. 
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Fig. 3.15 (a) The surface-wave energy propagates as the envelope of 
the wave packet with the group velocity U, while the individual 
wavelengths travel with the phase velocity c. (b) Change of shape of a 
wave packet due to normal dispersion as the faster-moving long 
wavelengths pass through the packet. At large distances from the 
source, the long wavelengths arrive as the first part of the surface- 
wave train (modified from Telford ef a/., 1990). 

3.3.4 Free oscillations of the Earth 

When a bell is struck with a hammer, it vibrates freely at a 
number of natural frequencies. The combination of 
natural oscillations that are excited gives each bell its par¬ 
ticular sonority. In an analogous way, the sudden release 
of energy in a very large earthquake can set the entire 
Earth into vibration, with natural frequencies of oscilla¬ 
tion that are determined by the elastic properties and 
structure of the Earth’s interior. The free oscillations 
involve three-dimensional deformation of the Earth’s 
spherical shape and can be quite complex. Before dis¬ 
cussing the Earth’s free oscillations it is worth reviewing 
some concepts of vibrating systems that can be learned 
from the one-dimensional excitation of a vibrating string 
that is fixed at both ends. 

Any complicated vibration of the string can be rep¬ 
resented by the superposition of a number of simpler 
vibrations, called the normal modes of vibration. These 
arise when travelling waves reflected from the boundaries 
at the ends of the string interfere with each other to give a 
standing wave. Each normal mode corresponds to a 
standing wave with frequency and wavelength determined 
by the condition that the length of the string must always 
equal an integral number of half-wavelengths (Fig. 3.16). 
As well as the fixed ends, there are other points on the 
string that have zero displacement; these are called the 
nodes of the vibration. The first normal (or fundamental) 







138 Seismology and the internal structure of the Earth 


(a) fundamental 



(c) second overtone 



Fig. 3.16 Normal modes of vibration for a standing wave on a string 
fixed at both ends. 


mode of vibration has no nodes. The second normal 
mode (sometimes called the first overtone) has one node; 
its wavelength and period are half those of the fundamen¬ 
tal mode. The third normal mode (second overtone) has 
three times the frequency of the first mode, and so on. 
Modes with one or more node are called higher-order 
modes. 

The concepts of modes and nodes are also applicable 
to a vibrating sphere. The complex general vibration of a 
sphere can be resolved into the superposition of a number 
of normal modes. The nodes of zero displacement 
become nodal surfaces on which the amplitude of the 
vibration is zero. The free oscillations of the Earth can be 
divided into three categories. In radial oscillations the dis¬ 
placements are purely radial, in spheroidal oscillations 
they are partly radial and partly tangential, and in 
toroidal oscillations they are purely tangential. 

3.3.4.1 Radial oscillations 

The simplest kind of free oscillations are the radial oscil¬ 
lations, in which the shape of the Earth remains ‘'spheri¬ 
cal” and all particles vibrate purely radially (Fig. 3.17a). 
In the fundamental mode of this type of oscillation the 
entire Earth expands and contracts in unison with a 
period of about 20.5 minutes. The second normal 
mode (first overtone) of radial oscillations has a single 
internal spherical nodal surface. While the inner sphere is 
contracting, the part outside the nodal surface is expand¬ 
ing, and vice versa. The nodal surfaces of higher modes 
are also spheres internal to the Earth and concentric with 
the outer surface. 

3.3.42 Spheroidal oscillations 

A general spheroidal oscillation involves both radial and 
tangential displacements that can be described by spheri¬ 


cal harmonic functions (see Box 2.3). These functions 
are referred to an axis through the Earth at the point of 
interest (e.g., an earthquake epicenter), and to a great 
circle which contains the axis. With respect to this refer¬ 
ence frame they describe the latitudinal and longitudinal 
variations of the displacement of a surface from a sphere. 
They allow complete mathematical description and 
concise identification of each mode of oscillation with the 
aid of three indices. The longitudinal order m is the number 
of nodal lines on the sphere that are great circles, the order 
l is determined from the (/- m) latitudinal nodal lines, and 
the overtone number n describes the number of internal 
nodal surfaces. The notation n S™ denotes a spheroidal 
oscillation of order /, longitudinal order m, and overtone 
number n. In practice, only oscillations with longitudinal 
order m = 0 (rotationally symmetric about the reference 
axis) are observed, and this index is usually dropped. Also, 
the oscillation of order /= 1 does not exist; it would have 
only a single equatorial nodal plane and the vibration 
would involve displacement of the center of gravity. The 
spheroidal oscillations 0 S 2 and 0 S 3 are shown in Fig. 3.17b. 
Spheroidal oscillations displace the Earth’s surface and 
alter the internal density distribution. After large earth¬ 
quakes they produce records on highly sensitive gravity 
meters used for bodily Earth-tide observations, and also on 
strain gauges and tilt meters. 

The radial oscillations can be regarded as a special type 
of spheroidal oscillation with 1 = 0. The fundamental 
radial oscillation is also the fundamental spheroidal oscil¬ 
lation, denoted 0 S 0 ; the next higher mode is called jS 0 
(Fig. 3.17a). 

33.4.3 Toroidal oscillations 

The third category of oscillation is characterized by dis¬ 
placements that are purely tangential. The spherical 
shape and volume of the Earth are unaffected by a 
toroidal oscillation, which involves only longitudinal dis¬ 
placements about an axis (Fig. 3.18a). The amplitude of 
the longitudinal displacement varies with latitude. (Note 
that, as for spheroidal oscillations, “latitude” and “longi¬ 
tude” refer to the symmetry axis and are different from 
their geographic definitions.) The toroidal modes have 
nodal planes that intersect the surface on circles of “lati¬ 
tude,” on which the toroidal displacement is zero. 
Analogously to the spheroidal oscillations, the notation 
n T ; is used to describe the spatial geometry of toroidal 
modes. The mode 0 T 0 has zero displacement and 0 Tj does 
not exist, because it describes a constant azimuthal twist 
of the entire Earth, which would change its angular 
momentum. The simplest toroidal mode is 0 T„ in which 
two hemispheres oscillate in opposite senses across a 
single nodal plane (Fig. 3.18a). Higher toroidal modes of 
order / oscillate across (/ - 1) nodal planes perpendicular 
to the symmetry axis. 

The amplitudes of toroidal oscillations inside the 
Earth change with depth. The displacements of internal 
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Fig. 3.17 (a) Modes of radial 
oscillation with their periods: 
in the fundamental mode 0 S 0 
(also called the balloon mode) 
the entire Earth expands and 
contracts in unison; higher 
modes, such as jSq, have 
internal spherical nodal 
surfaces concentric with the 
outer surface, (b) Modes of 
spheroidal oscillation 0 S 2 
("football mode") and 0 S 3 
with their periods; the nodal 
lines are small circles 
perpendicular to the 
symmetry axis. 


(a) radial 
oscillations 



(b) spheroidal 
oscillations 
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Fig. 3.18 (a) The modes of toroidal oscillation 0 T 2 and 0 T 3 with their 
periods; these modes involve oscillation in opposite senses across nodal 
planes normal to the symmetry axis, (b) In toroidal oscillations, 
displacements of internal spherical surfaces from their equilibrium 
positions vary with depth and are zero at internal nodal surfaces. 

spherical surfaces from their equilibrium positions 
are zero at internal nodal surfaces (Fig. 3.18b). The 
nomenclature for toroidal oscillations also represents 
these internal nodal surfaces. Thus, t T ; denotes a general 
toroidal mode with one internal nodal surface; the inter¬ 
nal sphere twists in an opposite sense to the outer spheri¬ 
cal shell; ,T, has two internal nodal surfaces, etc. (Fig. 
3.18b). 

The twisting motion in toroidal oscillations does not 
alter the radial density distribution in the Earth, and so 
they do not show in gravity meter records. They cause 
changes in strain and displacement parallel to the Earth’s 
surface and can be recorded by strain meters. Toroidal 
oscillations are dependent on the shear strength of the 
Earth’s interior. The Earth’s fluid core cannot take part in 


these oscillations, and they are therefore restricted to the 
Earth’s rigid mantle and crust. 

3.3.44 Comparison with surface waves 

The higher-order free oscillations of the Earth are related 
directly to the two types of surface wave. 

(i) In a Rayleigh wave the particle vibration is polarized 
in the vertical plane and has radial and tangential 
components (see Fig. 3.13). The higher-order spher¬ 
oidal oscillations are equivalent to the standing wave 
patterns that arise from the interference of trains of 
long-period Rayleigh waves travelling in opposite 
directions around the Earth. 

(ii) In a Love wave the particle vibration is polarized hor¬ 
izontally. The toroidal oscillations may be regarded as 
the standing wave patterns due to the interference of 
oppositely travelling Love waves. 

The similarity between surface waves and higher-order 
natural oscillations of the Earth is evident in the varia¬ 
tions of displacement with depth. Like any vibration, a 
train of surface waves is made up of different modes. 
Theoretical analysis of surface waves shows that the 
amplitudes of different modes decay with depth in the 
Earth (Fig. 3.19) in an equivalent manner to the depth 
attenuation of natural oscillations (Fig. 3.18b). 

The periods of the normal modes of free oscillations 
were calculated before they were observed. They have 
long periods - the period of 0 S 0 is 20.5 minutes, that of 
0 T 2 is 43.9 minutes and that of 0 S 2 is 53.9 minutes - and 
pendulum seismographs are not suitable for recording 
them. Their recognition had to await the development of 
long-period seismographs. The spheroidal oscillations 
have a radial component of displacement and can be 
recorded with long-period, vertical-motion seismographs. 
Continually recording gravimeters used for the obser¬ 
vation of bodily Earth-tides also record the spheroidal 
oscillations but not the toroidal oscillations, which have 
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Fig. 3.19 The attenuation with depth of some low-order modes of Love 
waves. 


no vertical component. These usually must be recorded 
with an instrument that is sensitive to horizontal displace¬ 
ments, such as the strain meter designed by H. Benioff in 
1935. Long-period, horizontal-motion seismographs are 
capable of recording the toroidal oscillations induced by 
great earthquakes. 

Strain meter records of the November 4, 1952, magni¬ 
tude 8 earthquake in Kamchatka exhibited a long-period 
surface wave with a period of 57 minutes (Benioff, 1958). 
This is much longer than the known periods of travelling 
surface waves, and was interpreted as a free oscillation of 
the Earth. Several independent investigators, using bodily 
Earth-tide gravity meters and different kinds of seismo¬ 
graph, recorded long-period waves excited by the massive 
1960 earthquake in Chile (surface-wave magnitude M s = 
8.5, moment magnitude M w = 9.5). These were conclu¬ 
sively identified with spheroidal and toroidal oscillations. 

Figure 3.20 illustrates some of the normal modes of 
free oscillation of the Earth set up by the huge 2004 
Sumatra-Andaman Islands earthquake (M w = 9.0). 
Vertical motions were recorded during 240 hours by the 
Canberra, Australia, station of the Geoscope global 
network of digital broadband seismic stations (see 
Section 3.4.3.3). The power spectrum in Fig. 3.20 is a plot 
of the energy associated with different frequencies of 
vibration. Some spheroidal and toroidal free oscillations 
are identified and illustrated schematically. The splitting 
of some normal modes is caused by the non-spherical 
shape and rotation of the Earth. 

The study of the natural oscillations of the Earth set 
up by large earthquakes is an important branch of seis¬ 
mology, because the normal modes are strongly depen¬ 
dent on the Earth’s internal structure. The low-order 
modes are affected by the entire interior of the Earth, 
while the higher-order modes react primarily to move¬ 
ments of the upper mantle. The periods of free oscillation 
are determined by the radial distributions of elastic prop¬ 


erties and densities in the Earth. Comparison of the 
observed periods of different modes with values com¬ 
puted for different models of the Earth's velocity and 
density structure provides an important check on the 
validity of the Earth model. The oscillations are damped 
by the anelasticity of the Earth. The low-order mode 
oscillations with periods up to 40-50 minutes persist 
much longer than the higher-order modes with short 
periods of only a few minutes. By studying the decay 
times of different modes, a profile of the anelastic quality 
factor (0 within the Earth can be calculated that is com¬ 
patible with results obtained for body waves. 


3.4 THE SEISMOGRAPH 

3.4.1 Introduction 

The earliest known instrument for indicating the arrival 
of a seismic tremor from a distant source is reputed to 
have been invented by a Chinese astronomer called 
Chang Heng in 132 AD. The device consisted of eight 
inverted dragons placed at equal intervals around the rim 
of a vase. Under each dragon sat an open-mouthed metal 
toad. Each dragon held a bronze ball in its mouth. When 
a slight tremor shook the device, an internal mechanism 
opened the mouth of one dragon, releasing its bronze 
ball, which fell into the open mouth of the metal toad 
beneath, thereby marking the direction of arrival of the 
tremor. The principle of this instrument was used in 
eighteenth-century European devices that consisted of 
brimful bowls of water or mercury with grooved rims 
under which tiny collector bowls were placed to collect 
the overflow occasioned by a seismic tremor. These 
instruments gave visible evidence of a seismic event but 
were unable to trace a permanent record of the seismic 
wave itself. They are classified as seismoscopes. 

The science of seismology dates from the invention of 
the seismograph by the English scientist John Milne in 
1892. Its name derives from its ability to convert an unfelt 
ground vibration into a visible record. The seismograph 
consists of a receiver and a recorder. The ground vibra¬ 
tion is detected and amplified by a sensor, called the seis¬ 
mometer or, in exploration seismology, the geophone. In 
modern instruments the vibration is amplified and fil¬ 
tered electronically. The amplified ground motion is con¬ 
verted to a visible record, called the seismogram. 

The seismometer makes use of the principle of inertia. 
If a heavy mass is only loosely coupled to the ground (for 
example, by suspending it from a wire like a pendulum as 
in Fig. 3.21), the motion of the Earth caused by a seismic 
wave is only partly transferred to the mass. While the 
ground vibrates, the inertia of the heavy mass assures that 
it does not move as much, if at all. The seismometer 
amplifies and records the relative motion between the 
mass and the ground. 

Early seismographs were undamped and reacted only 
to a limited band of seismic frequencies. Seismic waves 
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Fig. 3.20 Spectrum of natural 
oscillations of the Earth 
following the magnitude 9.0 
Sumatra-Andaman 
earthquake of December 26, 
2004 (after Park eta!., 2005). 




ground moves to left ground moves to right 

Fig. 3.21 The principle of the seismometer. Because of its inertia, a 
suspended heavy mass remains almost stationary when the ground and 
suspension move to the left or to the right. 


with inappropriate frequencies were barely recorded at 
all, but strong waves could set the instrument into reso¬ 
nant vibration. In 1903, the German seismologist Emil 
Wiechert substantially increased the accuracy of the 
seismograph by improving the amplification method and 
by damping the instrument. These early instruments 
relied on mechanical levers for amplification and record¬ 
ing signals on smoked paper. This made them both bulky 
and heavy, which severely restricted their application. 

A major technological improvement was achieved in 
1906, when Prince Boris Galitzin of Russia introduced 
the electromagnetic seismometer, which allowed galvano- 
metric recording on photographic paper. This electrical 
method had the great advantage that the recorder could 
now be separated from the seismometer. The seismometer 
has evolved constantly, with improvements in seismome¬ 
ter design and recording method, culminating in modern 


broadband instruments with digital recording on mag¬ 
netic tape, hard disk, or solid state memory device. 

3.4.2 Principle of the seismometer 

Seismometers are designed to react to motion of the 
Earth in a given direction. Mechanical instruments 
record the amplified displacement of the ground; electro¬ 
magnetic instruments respond to the velocity of ground 
motion. Depending on the design, either type may 
respond to vertical or horizontal motion. Some modern 
electromagnetic instruments are constructed so as to 
record simultaneously three orthogonal components of 
motion. Most designs employ variations on the pendu¬ 
lum principle. 

3.4.2.1 Vertical-motion seismometer 

In the mechanical type of vertical-motion seismometer 
(Fig. 3.22a), a large mass is mounted on a horizontal bar 
hinged at a pivot so that it can move only in the vertical 
plane. A pen attached to the bar writes on a horizontal 
rotating drum that is fixed to the housing of the instru¬ 
ment. The bar is held in a horizontal position by a weak 
spring. This assures a loose coupling between the mass 
and the housing, which is connected rigidly to the ground. 
Vertical ground motion, as sensed during the passage of a 
seismic wave, is transmitted to the housing but not to 
the inertial mass and the pen, which remain stationary. 
The pen inscribes a trace of the vertical vibration of the 
housing on a paper fixed to the rotating drum. This trace 
is the vertical-motion seismogram of the seismic wave. 
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vertical 
motion 
of drum 



Fig. 3.22 Schematic diagrams illustrating the principle of operation of 
the vertical-motion seismometer: (a) mechanical pendulum type (after 
Strahler, 1963), (b) electromagnetic, moving-coil type. 



Fig. 3.23 Schematic design of the pendulum type of horizontal-motion 
seismometer (after Strahler, 1963). 


The electromagnetic seismometer responds to the rela¬ 
tive motion between a magnet and a coil of wire. One of 
these members is fixed to the housing of the instrument 
and thereby to the Earth. The other is suspended by a 
spring and forms the inertial member. Two basic designs 
are possible. In the moving-magnet type, the coil is fixed 
to the housing and the magnet is inertial. In the moving- 
coil type the roles are reversed (Fig. 3.22b). A coil of wire 
fixed to the inertial mass is suspended between the poles 
of a strong magnet, which in turn is fixed to the ground by 
the rigid housing. Any motion of the coil within the mag¬ 
netic field induces a voltage in the coil proportional to the 
rate of change of magnetic flux. During a seismic arrival 
the vibration of the ground relative to the mass is con¬ 
verted to an electrical voltage by induction in the coil. The 
voltage is amplified and transmitted through an electrical 
circuit to the recorder. 


this stable position. Similarly, the horizontal-motion seis¬ 
mometer swings about its equilibrium position like a hori¬ 
zontal pendulum (in fact it is the housing of the 
instrument that moves and not the inertial mass). As in the 
vertical-motion seismometer, a pen or light-beam attached 
to the stationary inertial mass writes on a rotating drum 
(which in this case has a horizontal axis) and records the 
relative motion between the mass and the instrument 
housing. The trace of the ground motion detected with 
this instrument is the horizontal-motion seismogram of 
the seismic wave. 

The design of an electromagnetic horizontal-motion 
seismometer is similar to that of the vertical-motion type, 
with the exception that the axis of the moving member 
(coil or magnet) is horizontal. 


34.2.2 Horizontal-motion seismometer 

The principle of the mechanical type of horizontal-motion 
seismometer is similar to that of the vertical-motion 
instrument. As before the inertial mass is mounted on a 
horizontal bar, but the fulcrum is now hinged almost verti¬ 
cally so that the mass is confined to swing sideways in a 
nearly horizontal plane (Fig. 3.23). The behavior of the 
system is similar to that of a gate when its hinges are 
slightly out of vertical alignment. If the hinge axis is tilted 
slightly forward, the stable position of the gate is where its 
center of mass is at its lowest point. In any displacement of 
the gate, the restoring gravitational forces try to return it to 


34.2.3 Strain seismometer 

The pendulum seismometers described above are inertial 
devices, which depend on the resistance of a loosely 
coupled mass to a change in its momentum. At about the 
same time that he developed the inertial seismograph, 
Milne also conducted experiments with a primitive 
strain seismograph that measured the change in distance 
between two posts during the passage of a seismic wave. 
The gain of early strain seismographs was low. However, 
in 1935 H. Benioff invented a sensitive strain seismograph 
from which modern versions are descended. 

The principle of the instrument is shown in Fig. 3.24. It 
can record only horizontal displacements. Two collinear 









3.4 THE SEISMOGRAPH 


143 


rigid pillar 
cemented 


electronically 
or optically 
monitored 



Fig. 3.24 Schematic design of a strain seismometer (after Press and 
Siever, 1985). 


horizontal rods made of fused quartz so as to be insensi¬ 
tive to temperature change are attached to posts about 
20 m apart, fixed to the ground; their near ends are sepa¬ 
rated by a small gap. The changes in separation of the two 
fixed posts result in changes in the gap width, which are 
detected with a capacitance or variable-reluctance trans¬ 
ducer. In modern instruments the variation in gap width 
may be observed optically, using the interference between 
laser light-beams reflected from mirrors attached to the 
opposite sides of the gap. The strain instrument is capable 
of resolving strains of the order of 10 -8 to 10 -10 . 


3.4.3 The equation of the seismometer 

Inertial seismometers for recording horizontal and verti¬ 
cal ground motion function on the pendulum principle. 
When the instrument frame is displaced from its equilib¬ 
rium position relative to the inertial mass, a restoring force 
arises that is, to first order, proportional to the displace¬ 
ment. Let the vertical or horizontal displacement, depen¬ 
dent on the type of seismometer, be u and the restoring 
force —ku, and let the corresponding displacement of the 
ground be q. The total displacement of the inertial mass 
M is then (u + q), and the equation of motion is 

M^(w + q) = - ku (3.77) 


We now divide throughout by M, write klM= co 0 2 , and 
after rearranging the equation, we get the familiar equa¬ 
tion of forced simple harmonic motion 


d 2 l< , 0 __ 

fp +( 0 o u ~ 


d 2 q 

St 2 


(3.78) 


In this equation co 0 is the natural frequency (or resonant 
frequency ) of the instrument. For a ground motion with 
this frequency, the seismometer would execute large 
uncontrolled vibrations and the seismic signal could 
not be recorded accurately. To get around this problem, 
the seismometer motion is damped by providing a 
velocity-dependent force that opposes the motion. A 
damping term enters into the equation of motion, which 
becomes 


d 2 lt 

dt 2 


+ 2A« 0 |y + COqU — 


d 2 q 

dt 2 


(3.79) 


The constant A in this equation is called the damping 
factor of the instrument. It plays an important role in deter¬ 
mining how the seismometer responds to a seismic wave. 

A seismic signal is generally composed of numerous 
superposed harmonic vibrations with different frequen¬ 
cies. We can determine how a seismometer with natural fre¬ 
quency to 0 responds to a seismic signal with any frequency 
co by solving Eq. (3.78) with q = A coswt (Box 3.2). Here, A 
is the magnified amplitude of the ground motion, equal to 
the true ground motion multiplied by a magnification 
factor that depends on the sensitivity of the instrument. 
Let the displacement u recorded by the seismometer be 

u = Ucos (cot — A) (3.80) 

where U is the amplitude of the recorded signal and A is 
the phase difference between the record and the ground 
motion. As derived in Box 3.2, the phase lag A is given by 


A = 


tan 1 


/ 2A coco 0 \ 


(3.81) 


The solution of the equation of motion (Eq. (3.79)) 
gives the displacement u on the seismic record as 


u = 


_ A co 2 _ 

[(w^ — co 2 ) 2 + 4A 2 co 2 cof\ m 


cos (cot — A) 


(3.82) 


3.4.3.1 Effect of instrumental damping 

The ground motion caused by a seismic wave contains a 
broad spectrum of frequencies. Equation (3.82) shows 
that the response of the seismometer to different signal 
frequencies is strongly dependent on the value of the 
damping factor A (Fig. 3.25). A completely undamped 
seismometer has A = 0, and for small values of A the 
response of the seismometer is said to be underdamped. 
An undamped or greatly underdamped seismometer pref¬ 
erentially amplifies signals near the natural frequency, 
and therefore cannot make an accurate record of the 
ground motion; the undamped instrument will resonate at 
its natural frequency co Q . For all damping factors A < 1 /y[2 
the instrument response function has a peak, indicating 
preferential amplification of a particular frequency. 

The value A = 1 corresponds to critical damping, so- 
called because it delineates two different types of seis¬ 
mometer response in the absence of a forcing vibration. If 
A < 1, the damped, free seismometer responds to a distur¬ 
bance by swinging periodically with decreasing amplitude 
about its rest position. If A > 1, the disturbed seismometer 
behaves aperiodically, moving smoothly back to its rest 
position. However, if the damping is too severe (A > 1), 
the instrument is overdamped and all frequencies in the 
ground motion are suppressed. 

The optimum behavior of a seismometer requires 
that the instrument should respond to a wide range of 
frequencies in the ground motion, without preferential 
amplification or excessive suppression of frequencies. 
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Box 3.2: The seismometer equation 

Equation (3.79) is the damped equation of motion for 
the signal u recorded by a seismometer with natural 
frequency co Q and damping factor A, when the ground 
displacement is q. Let the ground displacement be 
q = Acoscot and the recorded seismic signal be 
u = t/cos (cot - A). Substituting in Eq. (3.79) we get the 
following equation: 

—w 2 Ucos(cot — A) — 2A&>co 0 f/sin( cat — A) 

+ co^Ucos(cot — A) = Acu 2 coscot (1) 


i.e., 

U[(co — co 2 ) cos (cot — A) — 2Acaca 0 sin (cot — A)] 

= Aco 2 coscot (2) 

If we now write 

(cag — co 2 ) = Rcoscp 2\coco 0 = Rsincp (3) 

where 

/ 2Acaca n \ ,. 

R = [(cag — co 2 ) 2 + 4A 2 ca 2 cag] 1/2 tanip=l—^ ,1 (4) 

\CO q CO / 

the equations reduce to 

U[Rcoscpcos(cot — A) — R sin <p sin (cot — A)] 

= Aco 2 cos cot (5) 

(77?cos (cot — A + cp) = Acorcoscot (6) 

Complex numbers (see Box 2.6) allow a simple 
solution to this equation. The function cos 9 is the real 
part of the complex number (i.e., cost) = Reje 1 *}). 
Therefore, 

t/7?Re{e (i( " ,-A+lf ’ )) } = Aco 2 Re{e( iw '>} (7) 

U= ^Refe^he-W^-A+f))} = ^Re{eWA-fO)} 

= ^-cos(A-c p) (8) 

The maximum amplitude of the record, U, is when 
cos(A-<p) = l. The corresponding phase lag A 

between the recorded signal and the ground motion is 


A = 


tan 1 


/ 2A coco Q \ 


(9) 


and the equation for the amplitude of the seismic 
record is given by 


u = 


Aco 2 


[(cag — ca 2 ) 2 + 4A 2 ca 2 cag] 1/2 


COS (cot — A) 


( 10 ) 



Normalized frequency, ca/a > 0 


Fig. 3.25 Effect of the damping factor X on the response of a 
seismometer to different signal frequencies. Critical damping 
corresponds to X = 1. Satisfactory operation corresponds to a damping 
factor between 0.7 and 1 (i.e., 70-100% of critical damping). 


This requires that the damping factor should be close to 
the critical value. It is usually chosen to be in the range 
70% to 100% of critical damping (i.e., 1/V2<A<1). 
At critical damping the response of the seismometer to 
a periodic disturbing signal with frequency co is given 

by 


u — 


Aco 2 

(cog + <W 2 ) 


cos (cot - A) 


(3.83) 


3.43.2 Long-period and short-period seismometers 

The natural period (2ttIco 0 ) of a seismometer is an impor¬ 
tant factor in determining what it actually records. Two 
examples of special interest correspond to instruments 
with very long and very short natural periods, respec¬ 
tively. 

The long-period seismometer is an instrument in which 
the resonant frequency co Q is very low. For all but the 
lowest frequencies we can write co Q <§ co. The phase lag A 
between the seismometer and the ground motion 
becomes zero (see Box 3.2), and the amplitude of the seis¬ 
mometer displacement becomes equal to the amplified 
ground displacement q: 

u — A coscot = q (3.84) 

The long-period seismometer is sometimes called a 
displacement meter. It is usually designed to record 
seismic signals with frequencies of 0.01-0.1 Hz (i.e., 
periods in the range 10-100 s). 
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The short-period seismometer is constructed to have a 
very short natural period and a correspondingly high res¬ 
onant frequency w 0 , which is higher than most frequen¬ 
cies in the seismic wave. Under these conditions we have 
« 0 > w, the phase difference A is again small and Eq. 
(3.83) becomes 

to 2 1 

u = ^Acoswt = -j q (3.85) 

o>5 co 0 

This equation shows that the displacement of the 
short-period seismometer is proportional to the accelera¬ 
tion of the ground, and the instrument is accordingly 
called an accelerometer. It is usually designed to respond 
to seismic frequencies of 1-10 Hz (periods in the range 
0.1-1 s). An accelerometer is particularly suitable for 
recording strong motion earthquakes, when the ampli¬ 
tude of the ground motion would send a normal type of 
displacement seismometer off-scale. 

3.4.33 Broadband seismometers 

Short-period seismometers operate with periods of 
0.1 — 1 s and long-period instruments are designed for 
periods greater than 10 s. The resolution of seismic 
signals with intermediate frequencies of 0.1 — 1 Hz 
(periods of 1 — 10 s) is hindered by the presence in this 
range of a natural form of seismic background noise. 
The noise derives from a nearly continuous succession of 
small ground movements that are referred to as micro¬ 
seisms. Some microseismic noise is of local origin, 
related to such effects as vehicular traffic, rainfall, wind 
action on trees, etc. However, an important source is the 
action of storm waves at sea, which is detectable on 
seismic records far inland. The drumming of rough surf 
on a shoreline and the interference of sea waves over 
deep water are thought to be the principal causes of 
microseismic noise. The microseismic noise has a low 
amplitude on a seismogram, but it may be as strong as a 
weak signal from a distant earthquake, which cannot be 
selectively amplified without also magnifying the noise. 
The problem is exacerbated by the limited dynamic range 
of short- or long-period seismometers. Short-period 
instruments yield records dominated by high frequencies 
while long-period devices smooth these out, giving a 
record with only a low-frequency content (Fig. 3.26a). 

The range between the strongest and weakest signals 
that can be recorded without distortion by a given instru¬ 
ment is called its dynamic range. Dynamic range is mea¬ 
sured by the power (or energy density) of a signal, and is 
expressed in units of decibels (dB). A decibel is defined as 
10 log 10 (signal power). Because power is proportional to 
the square of amplitude (Section 3.3.2.6), a decibel is 
equivalent to 20 log 10 (amplitude). So, for example, a 
range of 20 dB in power corresponds to a factor 10 varia¬ 
tion in acceleration, and a dynamic range of 100 dB cor¬ 
responds to a 10 5 variation in amplitude. Short- and 
long-period seismometers have narrow dynamic ranges 




Period (s) 

Fig. 3.26 (a) Comparison of short-period and long-period records of a 
teleseismic P-wave with a broadband seismometer recording of the 
same event, which contains more information than the other two 
records separately or combined, (b) Ranges of the ground acceleration 
(in dB) and periods of ground motion encompassed by the very 
broadband seismic system of the IRIS Global Seismic Network, 
compared with the responses of short-period and long-period 
seismometers and expected ground accelerations from magnitude 5.0 
and 9.5 earthquakes (stars) and from bodily Earth-tides (redrawn from 
Lay and Wallace, 1995). 

because they are designed to give optimum performance 
in limited frequency ranges, below or above the band of 
ground noise. This handicap was overcome by the design 
of broadband seismometers that have high sensitivity 
over a very wide dynamic range. 

The broadband seismometer has basically an 
inertial pendulum-type design, with enhanced capability 
due to a force-feedback system. This works by applying 
a force proportional to the displacement of the inertial 
mass to prevent it from moving significantly. The 
amount of feedback force applied is determined using 
an electrical transducer to convert motion of the mass 
into an electrical signal. The force needed to hold the 
mass stationary corresponds to the ground acceleration. 
The signal is digitized with 16-to-24 bit resolution, syn¬ 
chronized with accurate time signals, and recorded on 
magnetic tape, hard disk, or solid state memory. The 
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feedback electronics are critical to the success of this 
instrument. 

Broadband design results in a seismometer with great 
bandwidth and linear response. It is no longer necessary 
to avoid recording in the 1-10 s bandwidth of ground 
noise avoided by short-period and long-period seismome¬ 
ters. The recording of an earthquake by a broadband seis¬ 
mometer contains more useable information than can be 
obtained from the short-period or long-period recordings 
individually or in combination (Fig. 3.26a). 

Broadband seismometers can be used to register a wide 
range of signals (Fig. 3.26b). The dynamic range extends 
from ground noise up to the strong acceleration that would 
result from an earthquake with magnitude 9.5, and the 
periods that can be recorded range from high-frequency 
body waves to the very long period oscillations of the 
ground associated with bodily Earth-tides (Section 
2.3.3.5). The instrument is employed world-wide in 
modern standardized seismic networks, replacing short- 
period and long-period seismometers. 

3.4.4 The seismogram 

A seismogram represents the conversion of the signal 
from a seismometer into a time record of a seismic event. 
The commonest method of obtaining a directly visible 
record, in use since the earliest days of modern seismol¬ 
ogy, uses a drum that rotates at a constant speed to 
provide the time axis of the record, as shown schemati¬ 
cally in Fig. 3.22a and Fig. 3.23. In early instruments a 
mechanical linkage provided the coupling between 
sensor and record. The invention of the electromagnetic 
seismometer by Galitzin allowed transmission of the 
seismic signal to the recorder as an electrical signal. For 
many years, a galvanometer was used to convert the 
electrical signal back to a mechanical form for visual 
display. 

In a galvanometer a small coil is suspended on a fine 
wire between the poles of a magnet. The current in the 
coil creates a magnetic field that interacts with the field of 
the permanent magnet and causes a deflection of the coil. 
The electrical circuitry of the galvanometer is designed 
with appropriate damping so that the deflection of the 
galvanometer is a faithful record of the seismic signal. 
The deflection can be transferred to a visible record in a 
variety of ways. 

Mechanical and electromagnetic seismometers deliver 
continuous analog recordings of seismic events. These 
types of seismometer are now of mainly historic interest, 
having been largely replaced by broadband seismometers. 
Galvanometer-based analog recording has been super¬ 
seded by digital recording. 

3.4.4.1 Analog recording 

In an early method of recording, a smoked paper sheet 
was attached to the rotating drum. A fine stylus was 


connected to the pendulum by a system of levers. The 
point of the stylus scratched a fine trace on the smoked 
paper. Later instruments employed a pen instead of the 
stylus and plain paper instead of the smoked paper. These 
methods make a “ wiggly line" trace of the vibration. 

In a further development of galvanometric recording a 
light-beam was reflected from a small mirror attached to 
the coil or its suspension to trace the record on photo¬ 
graphic paper attached to the rotating drum. Photo¬ 
graphic methods of recording are free of the slight 
friction of mechanical contact and allow inventive modi¬ 
fications of the form of the record. In the variable density 
method, the galvanometer current modulated the inten¬ 
sity of a light-bulb, so that the fluctuating signal showed 
on the photographic record as successive light and dark 
bands. A variable area trace was obtained by using the 
galvanometer current to vary the aperture of a narrow 
slit, through which a light-beam passed on to the photo¬ 
graphic paper. 

Every seismogram carries an accurate record of the 
elapsed time. This may be provided by a tuned electrical 
circuit whose frequency of oscillation is controlled by the 
natural frequency of vibration of a quartz crystal. At 
regular intervals the timing system delivers a short 
impulse to the instrument, causing a time signal to be 
imprinted on the seismogram. Modern usage is to employ 
a time signal transmitted by radio. The timing lines 
appear as regularly spaced blips on a wiggly line trace, or 
as bright lines on photographic records. 

An important development after the early 1950s, espe¬ 
cially in commercial seismology, was the replacement of 
photographic recording by magnetic recording. The elec¬ 
trical current from the seismometer was sent directly to 
the recording head of a tape recorder. The varying 
current in the recording head imprinted a corresponding 
magnetization on a magnetic tape. Magnetic recorders 
might have 24 to 50 parallel channels, each able to record 
from a different source. 

Many years of development resulted in sophisticated 
methods of analog recording, but they have now been 
superseded universally by digital methods. 

3.4.42 Digital recording 

In digital recording, the analog signal from a seismome¬ 
ter is passed through an electronic device called an 
analog-to-digital converter, which samples the continu¬ 
ous input signal at discrete, closely spaced time intervals 
and represents it as a sequence of binary numbers. 
Conventional representation on the familiar decimal 
scale expresses a number as a sum of powers of 10 multi¬ 
plied by the digits 0—9. In contrast, a number is repre¬ 
sented on the binary scale as a sum of powers of 2 
multiplied by one of only two digits, 0 or 1. For example, 
in decimal notation the number 153 represents 1 X 10 2 + 
5 X 10 1 + 3 X 10°. In binary notation, the same number is 
represented as 



3.4 THE SEISMOGRAPH 


147 


153 = 128+ 16 + 8 + 1 

= 1 x2 7 + 0 x 2 6 + 0 x 2 5 + 1 x2 4 + 1 x2 3 
+ 0x2 2 + 0x2 1 + 1x2° 

= 10011001 . 

Each of the digits in a binary number is called a bit and 
the combination that expresses the digitized quantity is 
called a word, the binary number 10011001 is an eight-bit 
word. A binary number is much longer and thus more 
cumbersome for everyday use than the decimal form, but 
it is suitable for computer applications. Because it involves 
only two digits, a binary number can be represented by a 
simple condition, such as whether a switch is off or on, or 
the presence or absence of an electrical voltage or current. 

Digital recording of seismic signals was developed in 
the 1960s and since the early 1970s it has virtually 
replaced analog recording. The analog method records 
the signal, usually employing the galvanometer principle, 
on photographic film or on magnetic tape as a continuous 
time-varying voltage whose amplitude is proportional to 
a characteristic of the ground disturbance (displacement, 
velocity or acceleration). The dynamic range of the 
analog method is limited and the system has to be 
adapted specifically to the characteristics of the signal to 
be recorded. For example, an analog device for recording 
strong signals lacks the sensitivity needed to record weak 
signals, whereas an analog recorder of weak signals will 
be overloaded by a strong signal. The digital recording 
technique samples the amplified output of the seismome¬ 
ter at time increments of a millisecond or so, and writes 
the digitized voltage directly to magnetic tape or to a 
computer hard-disk. This avoids possible distortion of 
the signal that can result from mechanical or optical 
recording. Digital recording has greater fidelity than the 
analog method, its dynamic range is wide, and the data 
are recorded in a form suitable for numerical processing 
by high-speed computers. 

After processing, the digital record is usually con¬ 
verted back to an analog form for display and interpreta¬ 
tion. The processed digital signal is passed through a 
digital-to-analog converter and displayed as a wiggle 
trace or variable density record. The familiar continuous 
paper record is still a common form of displaying earth¬ 
quake records. However, instead of employing gal¬ 
vanometers to displace the pen in response to a signal, 
modern devices utilize a motor-driven pen; in this case the 
electrical signal from the earthquake record powers a 
small servo-motor which controls the pen displacement 
on the paper record. 

3.44.3 Phases on a seismogram 

The seismogram of a distant earthquake contains the 
arrivals of numerous seismic waves that have travelled 
along different paths through the Earth from the source to 
the receiver. The appearance of the seismogram can there¬ 
fore be very complicated, and its interpretation demands 



Fig. 3.27 Broadband seismograms of an earthquake in Peru recorded 
at Harvard, Massachusetts. Top'. the SH body wave and Love (L Q ) surface 
wave are prominent on the horizontal component record; bottom', the P 
and SV body waves and the Rayleigh (L R ) surface waves are clear on the 
vertical component record. Both seismograms also show several other 
phases (redrawn from Lay and Wallace, 1995). 


considerable experience. The analysis of seismic waves 
that have been multiply reflected and refracted in the 
Earth’s interior will be treated in Section 3.7. Each event 
that is recorded in the seismogram is referred to as a phase. 

As described in Section 3.3.2.1, the fastest seismic waves 
are the longitudinal waves. The first phase on the seismo¬ 
gram corresponds to the arrival of a longitudinal body 
wave, identified as the primary wave, or P-wave (Fig. 3.27). 
The next phase is the bodily shear wave, referred to as the 
secondary wave or S-wave, which usually has a larger 
amplitude than the P-wave. It is followed by the large- 
amplitude disturbance associated with surface waves, 
which are sometimes designated L-waves because of their 
much longer wavelengths. Dispersion (Section 3.3.3.3) 
causes the wavelengths at the head of the surface-wave 
train to be longer than those at the tail. Conventionally, 
Rayleigh waves are referred to as L R waves, while Love 
waves are called L Q waves. 

The arrivals recorded on any seismogram depend on 
the type of sensor used. For example, a vertical-compo¬ 
nent seismometer responds to P-, SV- and Rayleigh waves 
but does not register SH- or Love waves; a horizontal- 
component seismometer can register P-, SH-, Rayleigh 
and Love waves. The amplitudes of the different phases 
on a seismogram are influenced by several factors: the ori¬ 
entation of the instrument axis to the wave path, the epi- 
central distance (see Section 3.5.2), the focal mechanism 
and the structure traversed by the waves. 

Representative seismograms for a distant earthquake 
are shown in Fig. 3.27. They were recorded at Harvard, 
Massachusetts, for an earthquake that occurred deep 
beneath Peru on May 24, 1991. The seismic body waves 
from this earthquake have travelled through deep regions 
of the mantle and several seismic phases are recorded. 
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The upper seismogram is the trace of a horizontal-com¬ 
ponent seismometer oriented almost transverse to the 
seismic path, so the P-wave arrival is barely discernible. 
The first strong signal is the S body wave (in this case, an 
SH-wave), closely followed by several other phases 
(defined in Section 3.7.2) and the Love (L Q ) surface-wave 
train. The lower seismogram, recorded by a vertical- 
component seismometer, shows the arrivals of P and SV 
body waves and the Rayleigh (L R ) surface-wave train. 
Love waves travel along the surface at close to the near¬ 
surface S-wave velocity (K L q~/ 3); Rayleigh waves are 
slower, having L lr = 0.92/3, so they reach the recording 
station later than the Love waves. 


3.5 EARTHQUAKE SEISMOLOGY 

3.5.1 Introduction 

Most of the earthquakes which shake the Earth each year 
are so weak that they are only registered by sensitive 
seismographs, but some are strong enough to have 
serious, even catastrophic, consequences for mankind 
and the environment. About 90% of all earthquakes 
result from tectonic events, primarily movements on 
faults. The remaining 10% are related to volcanism, col¬ 
lapse of subterranean cavities, or man-made effects. 

Our understanding of the processes that lead to earth¬ 
quakes derives to a large extent from observations of 
seismic events on the San Andreas fault in California. The 
average relative motion of the plates adjacent to the San 
Andreas fault is about 5 cm yr -1 , with the block to the 
west of the fault moving northward. On the fault-plane 
itself, this motion is not continuous but takes place spas¬ 
modically. According to modern plate tectonic theory this 
extensively studied fault system is a transform fault. This 
is a rather special type, so it cannot be assumed that the 
observations related to the San Andreas fault are applica¬ 
ble without reservation to all other faults. However, the 
elastic rebound model, proposed by H. F. Reid after the 
1906 San Francisco quake, is a useful guide to how an 
earthquake may occur. 

The model is illustrated in Fig. 3.28 by the changes to 
five parallel lines, drawn normal to the trace of the fault in 
the unstrained state and intersecting it at the points A-E. 
Strain due to relative motion of the blocks adjacent to the 
fault accumulates over several years. Far from the trace of 
the fault the five lines remain straight and parallel, but 
close to it they are bent. When the breaking point of the 
crustal rocks at C is exceeded, rupture occurs and there is a 
violent displacement on the fault-plane. The relative dis¬ 
placement that has been taking place progressively 
between the adjacent plates during years or decades is 
achieved on the fault-plane in a few seconds. The strained 
rocks adjacent to the fault “rebound” suddenly. The accu¬ 
mulated strain energy is released with the seismic speed of 
the ruptured rocks, which is several kilometers per second. 
The segments BC and C'D undergo compression, while 
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Fig. 3.28 Elastic rebound model of the origin of earthquakes: (a) 
unstrained state of a fault segment, (b) accumulation of strain close to 
the fault due to relative motion of adjacent crustal blocks, and (c) 
"rebound" of strained segment as an earthquake with accompanying 
release of seismic energy. 


CD and BC' experience dilatation. The points A and E do 
not move; the stored strain energy at these points is not 
released. The entire length of the fault-plane is not dis¬ 
placed, only the region in which the breaking point has 
been exceeded. The greater the length of the fault-plane 
that is activated, the larger is the ensuing earthquake. 

The occurrence of a large earthquake is not necessarily 
as abrupt as described in the preceding paragraph, 
although it can be very sudden. In 1976 a major earth¬ 
quake with magnitude 7.8 struck a heavily populated area 
of northern China near the city of Tangshan. Although 
there were known faults in the area, they had long been 
seismically inactive, and the large earthquake struck 
without warning. It completely devastated the industrial 
region and caused an estimated 243,000 fatalities. 
However, in many instances the accumulating strain is 
partially released locally as small earthquakes, or fore¬ 
shocks. This is an indicator that strain energy is building 
up to the rupture level and is sometimes a premonition 
that a larger earthquake is imminent. 

When an earthquake occurs, most of the stored energy 
is released in the main shock. However, for weeks or 
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Fig. 3.29 Vertical section perpendicular to the plane of a normal fault, 
defining the epicenter and hypocenter (focus) of an earthquake. 

months after a large earthquake there may be numerous 
lesser shocks, known as aftershocks, some of which can be 
comparable in size to the main earthquake. Structures 
weakened by the main event often collapse in large after¬ 
shocks, which can cause physical damage as severe as the 
main shock. The death toll from aftershocks is likely to be 
less, because people have evacuated damaged structures. 

Although in fact the earthquake involves a part of the 
fault-plane measuring many square kilometers in area, 
from the point of view of an observer at a distance of 
hundreds or even thousands of kilometers the earth¬ 
quake appears to happen at a point. This point is called 
the focus or hypocenter of the earthquake (Fig. 3.29). It 
generally occurs at a focal depth many kilometers below 
the Earth’s surface. The point on the Earth’s surface 
vertically above the focus is called the epicenter of the 
earthquake. 

3.5.2 Location of the epicenter of an earthquake 

The distance of a seismic station from the epicenter of an 
earthquake (the epicentral distance) may be expressed 
in kilometers A km along the surface, or by the angle 
A o = (180/TT)(A km /i?) subtended at the Earth’s center. The 
travel-times of P- and S-waves from an earthquake through 
the body of the Earth to an observer are dependent on the 
epicentral distance (Fig. 3.30a). The travel-time versus dis¬ 
tance plots are not linear, because the ray paths of waves 
travelling to distant seismographs are curved. However, the 
standard seismic velocity profile of the Earth’s interior is 
well enough known that the travel-times for each kind of 
wave can be tabulated or graphed as a function of epicen¬ 
tral distance. In computing epicentral distance from earth¬ 
quake data the total travel-time is not at first known, 
because an observer is rarely at the epicenter to record the 
exact time of occurrence t () of the earthquake. However, 
the difference in travel-times for P- and S-waves (t c -1 J can 
be obtained directly from the seismogram; it increases with 
increasing epicentral distance (Fig. 3.30a). 

For local earthquakes we can assume that the seismic 
velocities a and (3 are fairly constant in the near-surface 
layers. The time when the earthquake occurred, t 0 , can 
then be obtained by plotting the differences {t s - tf 



Fig. 3.30 (a) Travel-times of P- and S-waves from an earthquake 
through the body of the Earth to an observer at epicentral distances up 
to 90°. The epicentral distance (A) of the earthquake is found from the 
difference in travel times (f s - f). (b) Wadati diagram for determining 
the time of occurrence of an earthquake. 

against the arrival times / of the P-wave at different sta¬ 
tions. The plot, called a Wadati diagram, is a straight line 
(Fig. 3.30b). If D is the distance travelled by the seismic 
wave, the travel-times of P- and S-waves are respectively 
t = Dla and t = DIB, so 

P ® 



The intercept t 0 of the straight line with the arrival¬ 
time axis is the time of occurrence of the earthquake and 
the slope of the line is [(al/3) - 1], Knowing the P-wave 
velocity a, the distance to the earthquake is obtained 
from D = a(t p -1 () ). 

In order to determine the location of an earthquake, 
epicenter travel-times of P- and S-waves to at least three 
seismic stations are necessary (Fig. 3.31). The data from 
one station give only the distance of the epicenter from 
that station. It could lie anywhere on a circle centered at 
the station. The data from an additional station define a 
second circle which intersects the first circle at two points, 
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Fig. 3.31 Location of an 
earthquake epicenter using 
epicentral distances of three 
seismic stations (at A, B and 
C). The epicentral distance of 
each station defines the 
radius of a circle centered on 
the station. The epicenter 
(triangle) is located at the 
common intersection of the 
circles; their oval appearance 
is due to the map projection. 



each of which could be the true epicenter. Data from a 
third station remove the ambiguity: the common point of 
intersection of the three circles is the epicenter. 

Generally the circles do not intersect at a point, but 
form a small spherical triangle. The optimum location of 
the epicenter is at the center of the triangle. If data are 
available from more than three seismic stations, the epicen¬ 
tral location is improved; the triangle is replaced by a small 
polygon. This situation arises in part from observational 
errors, and because the theoretical travel-times are imper¬ 
fectly known. The interior of the Earth is neither homoge¬ 
neous nor isotropic, as must be assumed. The exact 
location of earthquake epicenters requires detailed knowl¬ 
edge of the seismic velocities along the entire path, but 
especially under the source area and the receiving station. 
The main reason for the intersection triangle or polygon is, 
however, that the seismic rays travel to the seismograph 
from the focus, and not from the epicenter. The focal depth 
of the earthquake, d, which may be up to several hundred 
kilometers, must be taken into account. It can be estimated 
from simple geometry. If A km is the epicentral distance and 
D the distance travelled by the wave, to a first approxima¬ 
tion d = ( D 2 - A km 2 ) 1/2 . Combining several values of d from 
different recording stations gives a reasonable estimate of 
the focal depth. 

3.5.3 Global seismicity 

The epicenters of around 30,000 earthquakes are now 
reported annually by the International Seismological 
Center. The geographical distribution of world seismicity 
(see Fig. 1.10) dramatically illustrates the tectonically 
active regions of the Earth. The seismicity map is impor¬ 
tant evidence in support of plate tectonic theory, and 
delineates the presently active plate margins. 

Earthquake epicenters are not uniformly distributed 
over the Earth’s surface, but occur predominantly along 
rather narrow zones of inter plate seismic activity. The 


circum-Pacific zone, in which about 75-80% of the annual 
release of seismic energy takes place, forms a girdle that 
encompasses the mountain ranges on the west coast of the 
Americas and the island arcs along the east coast of Asia 
and Australasia. The Mediterranean-transasiatic zone, 
responsible for about 15-20% of the annual seismic energy 
release, begins at the Azores triple junction in the Atlantic 
Ocean and extends along the Azores-Gibraltar ridge; 
after passing through North Africa it makes a loop 
through the Italian peninsula, the Alps and the Dinarides; 
it then runs through Turkey, Iran, the Himalayan moun¬ 
tain chain and the island arcs of southeast Asia, where it 
terminates at the circum-Pacific zone. The system of 
oceanic ridges and rises forms the third most active zone of 
seismicity, with about 3-7% of the annually released 
seismic energy. In addition to their seismicity, each of 
these zones is also characterized by active volcanism. 

The remainder of the Earth is considered to be aseis- 
mic. However, no region of the Earth can be regarded as 
completely earthquake-free. About 1% of the global seis¬ 
micity is due to intraplate earthquakes, which occur 
remote from the major seismic zones. These are not neces¬ 
sarily insignificant: some very large and damaging earth¬ 
quakes (e.g. the New Madrid, Missouri, earthquakes of 
1811 and 1812 in the Mississippi river valley) have been of 
the intraplate variety. 

Earthquakes can also be classified according to their 
focal depths. Earthquakes with shallow focal depths, less 
than 70 km, occur in all the seismically active zones; only 
shallow earthquakes occur on the oceanic ridge systems. 
The largest proportion (about 85%) of the annual release 
of seismic energy is liberated in shallow-focus earth¬ 
quakes. The remainder is set free by earthquakes with 
intermediate focal depths of 70-300 km (about 12%) and 
by earthquakes with deep focal depths greater than 300 km 
(about 3%). These occur only in the circum-Pacific and 
Mediterranean-transasiatic seismic zones, and accom¬ 
pany the process of plate subduction. 









3.5 EARTHQUAKE SEISMOLOGY 


151 


Fig. 3.32 Schematic cross- 
section through a subduction 
zone. The most active region 
is the zone of contact 
between the converging 
plates at depths of 10-60 km. 
There may be a "back-arc" 
seismic zone in the overriding 
plate. Below about 70 km 
depth a Wadati-Benioff 
seismic zone is described 
within the subducting plate 
(after Isacks, 1989). 


seismic zone 

at plate volcanic 



The distributions of epicentral locations and focal 
depths of intermediate and deep earthquakes give impor¬ 
tant evidence for the processes at a subduction zone. 
When the earthquake foci along a subduction zone are 
projected onto a cross-section normal to the strike of the 
plate margin, they are seen to define a zone of seismicity 
about 30-40 km thick in the upper part of the 80-100 km 
thick subducting oceanic plate, which plunges at roughly 
30-60° beneath the overriding plate (Fig. 3.32). For many 
years the inclined seismic zone was referred to in Western 
literature as a Benioff zone in recognition of the 
Californian scientist, Hugo Benioff. In the years follow¬ 
ing World War II Benioff carried out important pioneer¬ 
ing studies that described the distribution of deep 
earthquakes on steeply dipping surfaces of seismicity. 
Many characteristics of the occurrence of deep earth¬ 
quakes had been described in the late 1920s by a Japanese 
seismologist, Kiyoo Wadati. He discovered that the closer 
the epicenters of earthquakes lay to the Asian continent, 
the greater were their focal depths; the deep seismicity 
appeared to lie on an inclined plane. It was Benioff, 
however, who in 1954 proposed as an explanation of the 
phenomenon that the ocean floor was being “subducted” 
underneath the adjacent land. This was a bold proposal 
well in advance of the advent of plate tectonic theory. 
Today the zone of active seismicity is called a 
Wadati-Benioff zone in recognition of both discoverers. 

In three dimensions the Wadati-Benioff zone consti¬ 
tutes an inclined slab dipping underneath the overriding 
plate. It marks the location and orientation of the upper 
surface of the subducting plate. The dip-angle of the zone 
varies between about 30° and 60°, becoming steeper with 
increasing depth, and it can extend to depths of several 
hundred kilometers into the Earth. The deepest reliably 


located focal depths extend down to about 680 km. 
Important changes in the crystalline structure of mantle 
minerals occur below this depth. 

The structure of a subducting plate is not always as 
simple as described. A detailed study of the subducting 
Pacific plate revealed a double Wadati-Benioff zone 
under northeast Honshu, Japan (Fig. 3.33). The seismic¬ 
ity at depths below 100 km defines two parallel planes 
about 30-40 km apart. The upper plane, identified with 
the top of the subducting plate, is in a state of compres¬ 
sion; the lower plane, in the middle of the slab, is in a state 
of extension. These stress states are the result of unbend¬ 
ing of the subducting plate, which had previously under¬ 
gone sharp bending at shallow depth below the trench 
axis. This information is inferred from analysis of the 
mechanisms by which the earthquakes occur. 

3.5.4 Analysis of earthquake focal mechanisms 

During an earthquake the accumulated elastic energy is 
released suddenly by physical displacement of the ground, 
as heat and as seismic waves that travel outwards from the 
focus. By studying the first motions recorded by seismo¬ 
graphs at distant seismic stations, the focal mechanism of 
the earthquake can be inferred and the motion on the 
fault-plane interpreted. 

Consider a vertical section perpendicular to the plane 
of a normal fault on which the hypocenter of an earth¬ 
quake is located at the point H (Fig. 3.34). When the 
region above the fault moves up-slope, it produces a 
region of compression ahead of it and a region of dilata¬ 
tion (or expansion) behind it. In conjunction with the 
compensatory down-slope motion of the lower block, the 
earthquake produces two regions of compression and 
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Fig. 3.33 Distribution of 
microearthquakes in a vertical 
section across a double 
subduction zone under the 
island of Honshu, Japan. 
Below 100 km the seismicity 
defines two parallel planes, 
one at the top of the 
subducting plate, the other in 
the middle. The upper plane is 
in a state of compression, the 
lower is under extension (after 
Hasegawa, 1989). 
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Fig. 3.34 Regions of compression and dilatation around an earthquake 
focus, separated by the fault-plane and the auxiliary plane. 

two regions of dilatation surrounding the hypocenter. 
These are separated by the fault-plane itself, and by an 
auxiliary plane through the focus and normal to the fault- 
plane. When a seismic P-wave travelling out from a region 
of compression reaches an observer at C, its first effect is 
to push the Earth’s surface upwards; the initial effect of a 
P-wave that travels out from a region of dilatation to an 
observer at D is to tug the surface downwards. The P- 
wave is the earliest seismic wave to reach a seismograph at 
C or D and therefore the initial motion recorded by the 
instrument allows us to distinguish whether the first 
arrival was compressional or dilatational. 

3.5.4.1 Single-couple and double-couple radiation patterns 

The amplitudes of P-waves and S-waves vary with distance 
from their source because of the effects of physical 
damping and geometric dispersion. The amplitudes also 
depend geometrically on the angle at which the seismic ray 
leaves the source. This geometric factor can be calculated 
mathematically, assuming a model for the source mecha¬ 
nism. The simplest is to represent the source by a single pair 
of antiparallel motions. Analysis of the amplitude of the 
P-wave as a function of the angle d between a ray and the 
plane of the fault (Fig. 3.35a) gives an equation of the form 
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Fig. 3.35 Azimuthal patterns of amplitude variation for (a) single¬ 
couple and (b) double-couple earthquake source models are the same 
for P-waves but differ for S-waves. The radius of each pattern at 
azimuth 0 from the fault-plane is proportional to the amplitude of the 
seismic wave in this direction. For P-waves the fault plane and auxiliary 
plane are nodal planes of zero displacement. The maximum 
compressional amplitude is along the T-axis at an angle of 45° to the 
fault plane. The dilatational amplitude is maximum along the P-axis, 
also at 45° to the fault plane. 

A(r,t,a,9 ) = A 0 (r,t,a)sin 2 2d (3.87) 

in which A 0 (r, t, a) describes the decrease in amplitude 
with distance r, time t, and seismic P-wave velocity a. A 
plot of the amplitude variation with d is called the radia¬ 
tion pattern of the P-wave amplitude, which for the 
single-couple model has a quadrupolar character (Fig. 
3.35a). It consists of four lobes, two corresponding to the 
angular variation of amplitude where the first motion 
is compressional, and two where the first motion is 
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Fig. 3.36 Method of 
determining the focal 
mechanism of an earthquake, 
(a) The focal sphere 
surrounding the earthquake 
focus, with two rays S, and S 2 
that cut the sphere at P, and 
P 2 , respectively, (b) The points 
P 1 and P 2 are plotted on a 
lower-hemisphere stereogram 
as first-motion pushes (solid 
points) or tugs (open points), 
(c) The best-fitting great 
circles define regions of 
compression (shaded) and 
tension (unshaded). The P- 
and T-axes are located on the 
bisectors of the angles 
between the fault-plane and 
auxiliary plane. 



dilatational. The lobes are separated by the fault-plane 
and the auxiliary plane. 

The radiation pattern for S-waves from a single-couple 
source is described by an equation of the form 

B(r,t,(3,6) = B 0 (r,t,(3)sin 2 6 (3.88) 

where the amplitude B is now dependent on the S-wave 
velocity /3. The radiation pattern has a dipolar character 
consisting of two lobes in which the first motions are of 
opposite sense. 

An alternative model of the earthquake source is to 
represent it by a pair of orthogonal couples (Fig. 3.35b). 
The double-couple source gives the same form of radia¬ 
tion pattern for P-waves as the single-couple source, but 
the radiation pattern for S-waves is quadrupolar instead 
of dipolar. This difference in the S-wave characteristics 
enables the seismologist to determine which of the two 
earthquake source models is applicable. S-waves arrive 
later than P-waves, so their first motions must be resolved 
from the background noise of earlier arrivals. They can 
be observed and are consistent with the double-couple 
model. 

Note that the maximum P-wave amplitudes occur at 
45° to the fault plane. The directions of maximum ampli¬ 
tude in the compressional and dilatational fields define 
the T-axis and P-axis, respectively. Here T and P imply 
‘'tension” and “compression,” respectively, the stress con¬ 
ditions before faulting. Geometrically the P- and T-axes 
are the bisectors of the angles between the fault-plane 
and auxiliary plane. The orientations of these axes and of 
the fault-plane and auxiliary plane can be obtained even 
for distant earthquakes by analyzing the directions of 
first motions recorded in seismograms of the events. The 
analysis is called a fault-plane solution, or focal mecha¬ 
nism solution. 


3.5.42 Fault-plane solutions 

The ray path along which a P-wave travels from an earth¬ 
quake to the seismogram is curved because of the variation 
of seismic velocity with depth. The first step in the fault- 
plane solution is to trace the ray back to its source. A ficti¬ 
tious small sphere is imagined to surround the focus (Fig. 
3.36a) and the point at which the ray intersects its surface is 
computed with the aid of standardized tables of seismic P- 
wave velocity within the Earth. The azimuth and dip of the 
angle of departure of the ray from the earthquake focus 
are calculated and plotted as a point on the lower hemi¬ 
sphere of the small sphere. This direction is then projected 
onto the horizontal plane through the epicenter. The pro¬ 
jection of the entire lower hemisphere is called a stere¬ 
ogram. The direction of the ray is marked with a solid 
point if the first motion was a push away from the focus 
(i.e., the station lies in the field of compression). An open 
point indicates that the first motion was a tug towards the 
focus (i.e., the station lies in the field of dilatation). 

First-motion data of any event are usually available 
from several seismic stations that lie in different directions 
from the focus. The solid and open points on the stere¬ 
ogram usually fall in distinct fields of compression and 
dilatation (Fig. 3.36b). Two mutually orthogonal planes 
are now drawn so as to delineate these fields as well as 
possible. The fit is best made mathematically by a least- 
squares technique, but often a visual fit is obvious and 
sufficient. The two mutually orthogonal planes corre¬ 
spond to the fault-plane and the auxiliary plane, although 
it is not possible to decide which is the active fault-plane 
from the seismic data alone. The regions of the stere¬ 
ogram corresponding to compressional first motions are 
usually shaded to distinguish them from the regions of 
dilatational first motions (Fig. 3.36c). The P- and T-axes 
are the lines that bisect the angles between the fault-plane 
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and auxiliary plane in the fields of dilatation and com¬ 
pression, respectively. To attach a physical meaning to the 
P- and T-axes we will have to take a closer look at the 
mechanics of faulting. 

3.5.43 Mechanics of faulting 

As discussed in elasticity theory, the state of stress can be 
represented by the magnitudes and directions of the three 
principal stresses a l >a 2 >(T r The directions of these 
principal stresses are by definition parallel to the coordi¬ 
nate axes and are therefore positive for tensional stress. 
The theory of faulting of homogeneous materials has 
been developed by studying the failure of materials under 
compressional stress, which is directed inwards toward the 
origin of the coordinate axes. The minimum tensional 
stress corresponds to the maximum compressional stress, 
and vice versa. The reason for taking this view is that geol¬ 
ogists are interested in the behavior of materials within the 
Earth, where pressure builds up with increasing depth and 
faulting occurs under high confining pressures. 

We can combine both points of view if we consider 
stress to consist of a part that causes change of volume 
and a part that causes distortion. The first of these is 
called the hydrostatic stress, and is defined as the mean 
(cr m ) of the three principal stresses: <r m = (o-j + cr 2 + <r 3 )l 3. 
If we now subtract this value from each of the principal 
stresses we get the deviatoric stresses: cr ', > cr' 2 >cr' y Note 
that cr' j is positive; it is a tensional stress, directed 
outward. However, <r' 3 is negative; it is a compressional 
stress, directed inward. 

Failure of a material occurs on the plane of maximum 
shear stress. For a perfectly homogeneous material this is 
the plane that contains the intermediate principal stress 
axis ( t 2 (or erf). The fault-plane is oriented at 45° to the 
axes of maximum and minimum compressional stress, i.e., 
it bisects the angle between the erf and <r' 3 axes. In real 
materials inhomogeneity and the effects of internal fric¬ 
tion result in failure on a fault-plane inclined at 20°-30° to 
the axis of maximum compression. Seismologists often 
ignore this complication when they make fault-plane 
analyses. The axis of maximum tensional stress erf is 
often equated with the T-axis, in the field of compressional 
first motions on the bisector of the angle between the two 
principal planes. The axis of maximum compressional 
stress erf is equated with the P-axis, in the field of dilata- 
tional first motions. In reality the direction of cr' 3 will he 
between the P-axis and the fault plane. 

The locations of P and T may at first seem strange, for 
the axes appear to lie in the wrong quadrants. However, 
one must keep in mind that the orientations of the princi¬ 
pal stress axes correspond to the stress pattern before 
the earthquake, while the fault-plane solution shows the 
ground motions that occurred after the earthquake. The 
focal mechanism analysis makes it possible to interpret 
the directions of the principal axes of stress in the Earth’s 
crust that led to the earthquake. 



TTj (b) reverse fault 


(c) strike-slip fault 


Fig. 3.37 The three main types of fault and their focal mechanisms. 

Left the orientations of each fault-plane and the principal deviatoric 
stresses, <r , and a 3 . Right: focal mechanisms and orientations of P- 
and T- axes. 

There are only three types of tectonic fault. These can 
be distinguished by the orientations of the principal axes 
of stress to the horizontal plane (Fig. 3.37). The focal 
solutions of earthquakes associated with each type of 
fault have characteristic geometries. When motion on the 
fault occurs up or down the fault plane it is called a 
dip-slip fault, and when the motion is horizontal, parallel 
to the strike of the fault, it is called a strike-slip fault. 

Two classes of dip-slip fault are distinguished depend¬ 
ing on the sense of the vertical component of motion. In a 
normal fault, the block on the upper side of the fault 
drops down an inclined plane of constant steepness rela¬ 
tive to the underlying block (Fig. 3.37a). The correspond¬ 
ing fault-plane solution has regions of compression at the 
margins of the stereogram. The T-axis is horizontal and 
the P-axis is vertical. A special case of this type is the 
listric fault, in which the steepness of the fault surface is 
not constant but decreases with increasing depth. 

In the second type of dip-slip fault, known as a reverse 
fault or thrust fault, the block on the upper side of the fault 
moves up the fault-plane, overriding the underlying block 
(Fig. 3.37b). The fault-plane solution is typified by a 
central compressional sector. The orientations of the axes 
of maximum tension and compression are the inverse of 
the case for a normal fault. The T-axis is now vertical and 
the P-axis is horizontal. When the fault-plane is inclined at 
a very flat angle, the upper block can be transported over 
large horizontal distances. This special type of overthrust 
fault is common in regions of continental collision, as for 
example in the Alpine-Himalayan mountain belts. 
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Fig. 3.38 Fault-plane 
solutions for hypothetical 
earthquakes at an ocean ridge 
and transform fault system. 
Note that the sense of 
movement on the fault is not 
given by the apparent offset 
of the ridge. The focal 
mechanisms of earthquakes 
on the transform fault reflect 
the relative motion between 
the plates. Note that in this 
and similar figures the sector 
with compressional first 
motions is shaded. 




The simplest type of strike-slip fault is the transcurrent 
fault, in which the fault-plane is steep or vertical (Fig. 
3.37c). To cause this type of fault the T- and P-axes must 
both lie in the horizontal plane. The fault-plane solution 
shows two compressional and two dilatational quadrants. 
Each side of the fault moves in opposite horizontal direc¬ 
tions. If the opposite side of the fault to an observer is 
perceived to move to the left, the fault is said to be sinis- 
tral, or left-handed; if the opposite side moves to the 
right, the fault is dextral, or right-handed. 

A variant of the strike-slip fault plays a very impor¬ 
tant role in the special context of plate tectonics. A trans¬ 
form fault allows horizontal motion of one plate relative 
to its neighbor. It joins segments of a spreading ocean 
ridge, or segments of a subducting plate margin. It is 
important in plate tectonics because it constitutes a con¬ 
servative plate margin at which the tectonic plates are 
neither formed nor destroyed. The relative motion on the 
transform fault therefore reveals the direction of motion 
on adjacent segments of an active plate margin. The sense 
of motion is revealed by the pattern of compressional and 
dilatational sectors on the fault-plane solution. 

3.5.44 Focal mechanisms at active plate margins 

Some of the most impressive examples of focal mechanism 
solutions have been obtained from active plate margins. 
The results fully confirm the expectations of plate tectonic 
theory and give important evidence for the directions of 
plate motions. We can first ask what types of focal mecha¬ 
nism should be observed at each of the three types of 
active plate margin. In the theory of plate tectonics these 
are the constructive (or “accreting”), conservative (or 
“transform”) and destructive (or “consuming”) margins. 

Oceanic spreading systems consist of both constructive 
and conservative margins. The seismicity at these plate 
margins forms narrow belts on the surface of the globe. 
The focal depths are predominantly shallow, generally less 
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Fig. 3.39 Fault-plane solutions for earthquakes along the Mid-Atlantic 
Ridge, showing the prevalence of extensional tectonics with normal 
faulting in the axial zone of the spreading center (based on data from 
Huang etal., 1986). 

than 10 km below the ocean bottom. Active ridge segments 
are separated by transform faults (Fig. 3.38). New oceanic 
lithosphere is generated at the spreading oceanic ridges; 
the separation of the plates at the spreading center is 
accompanied by extension. The plates appear to be pulled 
apart by the plate tectonic forces. The extensional nature of 
the ridge tectonics is documented by fault-plane solutions 
indicative of normal faulting, as is seen for some selected 
earthquakes along the Mid-Atlantic Ridge (Fig. 3.39). In 









156 Seismology and the internal structure of the Earth 


Fig. 3.40 Fault-plane 
solutions for earthquakes on 
the St. Paul, Romanche and 
Chain transform faults in the 
Central Atlantic ocean (after 
Engeln efa/., 1986). Most 
focal mechanisms show right 
lateral (dextral) motions on 
these faults, corresponding to 
the relative motion between 
the African and American 
plates. 
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each case the fault-plane is oriented parallel to the strike of 
the ridge. On a ridge segment that is nearly normal to the 
nearest transform faults the focal mechanism solution is 
symmetric, with shaded compressional quadrants at the 
margins of the stereogram. Note that where the ridge is 
inclined to the strike of the transform fault the focal mech¬ 
anism solution is not symmetric. This means that the plates 
are not being pulled apart perpendicular to the ridge. The 
fault-plane is still parallel to the strike of the ridge, but the 
slip-vector is oblique; the plate motion has a component 
perpendicular to the ridge and a component parallel to the 
ridge. We can understand why the direction of plate 
motion is not determined by the strike of the ridge axis by 
examining the motion on the adjacent transform faults. 

The special class of strike-slip fault that joins active 
segments of a ridge or subduction zone is called a trans¬ 
form fault because it marks a conservative plate margin, 
where the lithospheric plates are being neither newly gen¬ 
erated nor destroyed. The adjacent plates move past each 
other on the active fault. Relative plate motion is present 
only between the ridge segments. Almost the entire seis¬ 
micity on the transform fault is concentrated in this 
region. On the parts of the fracture zone outside the 
segment of active faulting the plates move parallel to each 
other and there is little or no seismicity. 

Because the relative motion is horizontal, the fault- 
plate solution is typical of a strike-slip fault. However, if 
the visible offset of the ridge segments were used to inter¬ 
pret the sense of motion on this fault, the wrong conclu¬ 
sion would be drawn. The conventional interpretation of 
this class of faults as a transcurrent fault was an early 
stumbling block to the development of plate tectonic 
theory. As indicated by arrows on the focal mechanism 
diagrams (Fig. 3.38), the relative motion on a transform 
fault is opposite to what one would expect for a transcur¬ 
rent fault. It is determined by the opposite motions of the 
adjacent plates and not by the offset of the ridge seg¬ 
ments. Hence, the focal mechanisms for a number of 
earthquakes on transform faults at the Mid-Atlantic 
Ridge in the central Atlantic reflect the eastwards motion 
of the African plate and westwards motion of the 
American plate (Fig. 3.40). 


If we ignore changes in plate motion (which can, 
however, occur sporadically on some ridge systems), the 
offset of the neighboring ridge segments is a permanent 
feature that reflects how the plates first split apart. The 
orientations of the transform faults are very important, 
because the plates must move parallel to these faults. 
Thus the transform faults provide the key to determining 
the directions of plate motion. Where a ridge axis is not 
perpendicular to the transform fault the plate motion will 
have a component parallel to the ridge segment, which 
gives an oblique focal mechanism on the ridge. 

A destructive (or consuming) plate margin is marked 
by a subduction zone, where a plate of oceanic lithosphere 
is destroyed by plunging under another plate of oceanic or 
continental lithosphere. Because this is a margin of con¬ 
vergence of the adjacent plates, the earthquake fault- 
plane solutions are typical of a compressional regime 
(Fig. 3.38). The regions of compressional first motion are 
in the center of the stereogram, indicating reverse faulting; 
the P-axes of maximum compressive stress are perpendic¬ 
ular to the strike of the subduction zone. 

The type of focal mechanism observed at a subduction 
zone is dependent on the focal depth of the earthquake. 
This is because the state of stress varies within the subduct¬ 
ing plate. At first the overriding plate is thrust at a shallow 
angle over the subducting plate. In the seismic zone at the 
contact between the two plates earthquake focal mecha¬ 
nisms are typical of low-angle reverse faulting. The focal 
mechanisms of earthquakes along the west coast of 
Mexico illustrate the first of these characteristics (Fig. 
3.41). The selected earthquakes have large magnitudes 
(6.9 < M s £ 7.8 ) and shallow focal depths. The strikes of the 
fault-planes follow the trend of the oceanic trench along 
the Mexican coastline. The focal mechanisms have a central 
sector of compressional first motions and the fault-plane is 
at a low angle to the northeast, typical of overthrusting tec¬ 
tonics. The seismicity pattern documents the subduction of 
the Cocos plate under the North American plate. 

As a result of the interplate collision the subducting 
oceanic plate is bent downwards and its state of stress 
changes. Deeper than 60-70 km the seismicity does not 
arise from the contact between the converging plates. It is 
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Fig. 3.41 Fault-plane 
solutions for selected large 
shallow earthquakes in the 
subduction zone along the 
west coast of Mexico (after 
Singh efa/., 1984). The focal 
mechanisms indicate low- 
angle overthrusting, as the 
Cocos plate is subducted to 
the northeast under Mexico. 
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Fig. 3.42 Fault-plane 
solutions of earthquakes 
along the arcuate Flimalayan 
mountain belt show normal 
faulting on north-south 
oriented fault planes in 
southern Tibet, and mainly 
low-angle thrust faults along 
the Lesser Himalaya mountain 
chain (after Ni and Barazangi, 
1984). 



ITS = Indus-Tsangpo Suture ▲ 

MCT = Main Central Thrust ? sli P Action on low-angle thrust fault 
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caused by the stress pattern within the subducted plate 
itself. The focal mechanisms of some intermediate-depth 
earthquakes (70-300 km) show down-dip extension (i.e., 
the T-axes are parallel to the surface of the dipping slab) 
but some show down-dip compression (i.e., the P-axes are 
parallel to the dip of the slab). At great depths in most 
Wadati-Benioff zones the focal mechanisms indicate 
down-dip compression. 

3.54.5 Focal mechanisms in continental collisional zones 

When the continental portions of converging plates 
collide, they resist subduction into the denser mantle. The 
deformational forces are predominantly horizontal and 
lead to the formation of folded mountain ranges. The 
associated seismicity tends to be diffuse, spread out over a 
large geographic area. The focal mechanisms of earth¬ 


quakes in the folded mountain belt reflect the ongoing col¬ 
lision. The collision of the northward-moving Indian 
plate with the Eurasian plate in the Late Tertiary led to the 
formation of the Himalayas. Focal mechanisms of earth¬ 
quakes along the arcuate mountain belt show that the 
present style of deformation consists of two types (Fig. 
3.42). Two fault-plane solutions south of the main moun¬ 
tain belt correspond to an extensional regime with normal 
faulting. To the north, in southern Tibet, the fault-plane 
solutions also show normal faulting on north-south ori¬ 
ented fault planes. Beneath the Lesser Himalaya seismic 
events distributed along the entire 1800 km length of the 
mountain chain indicate a deformational regime with 
some strike-slip faulting but predominantly low-angle 
thrust faulting. The Indian continental crust appears to be 
thrusting at a shallow angle under the Tibetan continental 
crust. This causes crustal thickening toward the north. In 
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Fig. 3.43 Fault-plane 
solutions for earthquakes in 
or near the Swiss Alps in 
central Europe. The arrows 
show the horizontal 
components of the 
interpreted axes of maximum 
compression. They are 
approximately at right angles 
to the strike of the mountain 
ranges of the Jura and Alps 
(after Mayer Rosa and Muller, 
1979, and Pavoni, 1977). 
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the main chain of the Lesser Himalaya the minimum com¬ 
pressive stress is vertical. The increased vertical load due 
to crustal thickening causes the directions of the principal 
stresses to change, so that under southern Tibet the 
maximum compressive stress is vertical. 

A different type of collisional tectonics is shown by 
fault-plane solutions from the Alpine mountain belt 
in south-central Europe. The Alps were formed during 
the collision between the African and European plates, 
starting in the Early Tertiary. The focal mechanisms of 
many, mostly small earthquakes show that they are pre¬ 
dominantly associated with strike-slip faults (Fig. 3.43). 
The horizontal projections of the compressional axes are 
oriented almost perpendicular to the strike of the Alps 
and rotate along the arc. The fault-plane solutions for the 
modern seismicity indicate that the Alpine fold-belt is a 
region of continuing interplate collision. 

3.5.5 Secondary effects of earthquakes: landslides, tsunami, 
fires and fatalities 

Before discussing methods of estimating the size of earth¬ 
quakes, it is worth considering some secondary effects that 
can accompany large earthquakes: landslides, seismic sea 
waves and conflagrations. These rather exceptional effects 
cannot be included conveniently in the definition of earth¬ 
quake intensity, because their consequences cannot be 
easily generalized or quantified. For example, once a 
major fire has been initiated, other factors not related 
directly to the size of the earthquake (such as aridity of 
foliage, combustibility of building materials, availability 
and efficiency of fire-fighting equipment) determine how 
it is extinguished. 

A major hazard associated with large earthquakes in 
mountainous areas is the activation of landslides, which 
can cause destruction far from the epicenter. In 1970, an 


earthquake in Peru with magnitude 7.8 and shallow focal 
depth of 40 km caused widespread damage and a total 
death toll of about 66,000. High in the Cordillera Blanca 
mountains above the town of Yungay, about 15 km away, 
an enormous slide of rock and ice was released by the 
tremors. Geologists later speculated that a kind of “air 
cushion'’ had been trapped under the mass of rock and 
mud, enabling it to acquire an estimated speed of 300-400 
km h -1 , so that it reached Yungay less than five minutes 
later. Over 90% of the town was buried under the mud and 
rock, in places to a depth of 14 m, and 20,000 lives were lost. 

When a large earthquake occurs under the ocean, it can 
activate a seismic sea wave known as a tsunami, which in 
Japanese means “harbor wave.” This special type of sea 
wave (Box 3.3) is a long-wavelength disturbance of the 
ocean surface that can be triggered by large-scale collapse 
or uplift of part of the ocean floor, by an underwater land¬ 
slide, or by submarine volcanism. The entire water column 
must be displaced to set off a tsunami, so only earthquakes 
with magnitude larger than about 7.5 are capable of doing 
so. The magnitude 9.0 Sumatra-Andaman Islands earth¬ 
quake on December 26, 2004, had a rupture zone 1300 km 
in length and 150 km in width, with displacements on the 
fault of 20 m (Lay et al., 2005). This probably resulted in 
uplift of the ocean bottom by several meters, which initi¬ 
ated a disastrous tsunami in the Indian Ocean. 

A tsunami propagates throughout an ocean basin as a 
wave with period T of around 15-30 min. The entire 
water column participates in the motion. As a result, the 
velocity of the wave, v, is dependent on the water depth, d, 
and the acceleration due to gravity, g, and is given by: 

v = <gd (3.89) 

Over an ocean basin with water depth greater than 
4km, the tsunami velocity is higher than 200 m s -1 
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Box 3.3: Tsunami 


When the ocean floor is abruptly lifted or dropped by a 
submarine earthquake or landslide, the entire water 
column is pushed up and down. More than 95% of the 
potential energy of the displaced water is gravitational 
and less than 5% is elastic energy resulting from com¬ 
pression of the ocean floor or water column. The poten¬ 
tial energy of the vertical motion is converted to kinetic 
energy and propagates away from the source as a 
tsunami. The propagation of ocean waves is a complex 
non-linear problem that cannot be handled here, but 
approximate solutions are useful for the special case of 
a tsunami. 

Ocean surface waves with periods shorter than 50 s are 
confined to the top kilometer of the ocean and cannot be 
excited by motions of the sea floor. The propagation of 
ocean waves is in general dispersive, i.e., the wave velocity 
depends on its wavelength and period (see Section 
3.3.3.3). Figure B3.3 shows the computed wave velocities 
for different periods in water of different depths. The 
wave speeds are dispersive for periods shorter than about 
300 s (5 minutes). These waves have very long wave¬ 
lengths, much greater than the ocean depth. 

For a wave with period T (angular velocity w = 2ttIT) 
and wavelength A (wave number fc = 2-7r/A, the disper¬ 
sion in an ocean of depth d is governed by the relation 
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Fig. B3.3 Dependence of the phase-velocity (c) and group-velocity 
( U) of a tsunami on the period of the wave for different ocean depths 
(source: S. N. Ward, personal communication, 2006). 

c = j=^gd (4) 

The group velocity ( U) is the propagation speed of the 
envelope of a wave packet, and thus is the speed with 
which the energy of the tsunami travels. Using Eq. (3) 
and Eq. (3.82), the group velocity for a tsunami is given 
by 


a) 2 = gktanh(kd) 


( 1 ) 



igd 


(5) 


where g is the acceleration of gravity and tanh(x) is the 
hyperbolic tangent function of x: 


tanh(x) = + g 


lx, if x small 
l 1, if x large 


( 2 ) 


If the wavelength A is very much greater than the 
ocean depth, tanh(M) can be replaced in Eq. (1) by ( kd ) 
and the dispersion relation becomes 


w 2 = gk(kd) = k 2 gd 

o^k^gd (3) 


Analogously to surface waves, a tsunami travels 
across an ocean as a packet of waves of different period. 
The phase velocity (c) of a wave is the speed of an indi¬ 
vidual phase in the packet. Using Eq. (3) and Eq. (3.62), 
the phase velocity for a tsunami is given by 


For water depths that are much less than the wave¬ 
length the phase and group velocities are equal and the 
propagation of a tsunami is non-dispersive. These rela¬ 
tions are valid for wavelengths greater than about 11 
times the water depth. Over an ocean basin that is 4 km 
deep the propagation velocity is 200 m s' 1 (720 km h -1 ) 
and the wavelengths for periods of 1000-2000 s (around 
15-30 min) are 200M00 km. When the tsunami 
approaches shore and the water depth decreases, its 
velocity slows. The advancing wave loses kinetic energy, 
part of which is lost as friction with the ocean bottom 
and part converted into potential energy. This is evident 
as in increase in wave height, from a few tens of centime¬ 
ters over the open ocean to many meters close to land. 
The contact with land is less often as a ferocious break¬ 
ing wave than as a rapid increase in sea-level - as during 
high tides - accompanied by fierce currents which may 
sweep far inland and withdraw in the same way. 


(720km h -1 , 450 m.p.h), and the wavelength (equal to the 
product vT) may measure 200 km (Box 3.3). The amplitude 
of a tsunami over the open ocean is comparatively small; 
the Sumatra tsunami measured about 80-100 cm from 
crest to trough in the open Indian Ocean. Despite the 
speed of the wave, an observer on a ship would be scarcely 
aware of the passage of such a low-amplitude, long- 


wavelength disturbance. However, on approaching shal¬ 
lower water the leading part of the tsunami slows down 
and tends to be overridden by the following water mass, so 
that the height of the wave increases. The wave height may 
be amplified by the shapes of the sea-bottom and the 
coastline to several meters. In 1896 a large earthquake 
raised the ocean-bottom off the southern shore of Japan 
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Fig. 3.44 Propagation of 
tsunami waves across the 
Pacific Ocean following the 
1960 Chilean earthquake (M w 
= 9.5) and the 1964 Alaskan 
earthquake (M w = 9.2). 
Numbers on the wavefronts 
show the travel-times in 
hours. 



and initiated a tsunami that raced ashore with an estimated 
wave height of more than 20 m, causing 26,000 fatalities. 
One of the best-studied tsunami was set off by a great 
earthquake in the Aleutian islands in 1946. It travelled 
across the Pacific and several hours later reached Hilo, 
Hawaii, where it swept ashore and up river estuaries as a 
wave 7 m high. A consequence of the devastation by this 
tsunami around the Pacific basin was the formation of the 
Tsunami Warning System. When a major earthquake is 
detected that can produce a tsunami, a warning is issued to 
alert endangered regions to the imminent threat. The 
system works well far from the source. It may take several 
hours for the tsunami to arrive at a distant location, as 
illustrated by records of tsunami propagation for the 1960 
Chilean and 1964 Alaskan earthquakes (Fig. 3.44). This 
allows time to warn and evacuate the population in many 
places far from the epicenter. However, tsunami casualties 
may still occur near to the epicenter of the earthquake. 

The tsunami warning system is currently effective only 
in the Pacific Ocean. The 2004 Sumatra earthquake trig¬ 
gered the worst tsunami in recorded history with proba¬ 
bly more than 250,000 fatalities in Indonesia, Thailand, 
Sri Lanka and other countries bordering the Indian 
Ocean as far away as Somalia. As a consequence of this 
disaster tsunami warning systems are planned for the 
Indian and other marine basins. 

In addition to causing direct damage to man-made 
structures, an earthquake can disrupt subterranean supply 
routes (e.g., telephone, electrical and gas lines) which in 
turn increases the danger of explosion and fire. Aqueducts 
and underground water pipelines may be broken, with 
serious consequences for the inhibition or suppression of 
fires. The San Francisco earthquake of 1906 was very 
powerful. The initial shock caused widespread damage, 
including the disruption of water supply lines. But a great 
fire followed the earthquake, and because the water supply 


lines were broken by the tremor, it could not be extin¬ 
guished. The greatest damage in San Francisco resulted 
from this conflagration. 

3.5.6 Earthquake size 

There are two methods of describing how large an earth¬ 
quake is. The intensity of the earthquake is a subjective 
parameter that is based on an assessment of visible effects. 
It therefore depends on factors other than the actual size 
of the earthquake. The magnitude of an earthquake is 
determined instrumentally and is a more objective 
measure of its size, but it says little directly about the seri¬ 
ousness of the ensuing effects. Illogically, it is usually the 
magnitude that is reported in news coverage of a major 
earthquake, whereas the intensity is a more appropriate 
parameter for describing the severity of its effects on 
mankind and the environment. 

3.5.6.1 Earthquake intensity 

Large earthquakes produce alterations to the Earth’s 
natural surface features, or severe damage to man-made 
structures such as buildings, bridges and dams. Even 
small earthquakes can result in disproportionate damage 
to these edifices when inferior constructional methods or 
materials have been utilized. The intensity of an earth¬ 
quake at a particular place is classified on the basis of the 
local character of the visible effects it produces. It 
depends very much on the acuity of the observer, and is in 
principle subjective. Yet, intensity estimates have proved 
to be a viable method of assessing earthquake size, 
including historical earthquakes. 

The first attempt to grade earthquake severity was made 
in the late eighteenth century by Domenico Pignataro, an 
Italian physician, who classified more than 1000 earth- 
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Table 3.1 Abridged and simplified version of the European Macroseismic Scale 1998 (European Seismological 
Commission, 1998) for earthquake intensity 

The scale focuses especially on the effects on people and buildings. It takes into account classifications of both the 
vulnerability of a structure (i.e., the materials and method of construction) and the degree of damage. 


Intensity Description of effects 


I—IV light to moderate earthquakes 

I Not felt 

II Scarcely felt Felt only by a few individual people at rest in houses. 

III Weak Felt indoors by a few people. People at rest feel a swaying or light trembling. 

IV Largely observed Felt indoors by many people; outdoors by very few. A few people are awakened. Windows, doors 
and dishes rattle. 

V—VIII moderate to severe earthquakes 

V Strong Felt indoors by most, outdoors by few. Many sleeping people awake. A few are frightened. Buildings 
tremble throughout. Hanging objects swing considerably. Small objects are shifted. Doors and windows swing 
open or shut. 

VI Slightly damaging Many people are frightened and run outdoors. Some objects fall. Many houses suffer slight 
non-structural damage like hair-line cracks and fall of small pieces of plaster. 

VII Damaging Most people are frightened and run outdoors. Furniture is shifted and objects fall from shelves in large 
numbers. Many well built ordinary buildings suffer moderate damage: small cracks in walls, fall of plaster, parts of 
chimneys fall down; older buildings may show large cracks in walls and failure of fill-in walls. 

VIII Heavily damaging Many people find it difficult to stand. Many houses have large cracks in walls. A few well built 
ordinary buildings show serious failure of walls, while weak older structures may collapse. 

IX-XII severe to destructive earthquakes 

IX Destructive General panic. Many weak constructions collapse. Even well built ordinary buildings show very heavy 
damage: serious failure of walls and partial structural failure. 

X Very destructive Many ordinary well built buildings collapse. 

XI Devastating Most ordinary well built buildings collapse, even some with good earthquake resistant design are 
destroyed. 

XII Completely devastating Almost all buildings are destroyed. 


quakes that devastated the southern Italian province of 
Calabria in the years 1783-1786. His crude analysis classi¬ 
fied the earthquakes according to whether they were very 
strong, strong, moderate or slight. In the mid-nineteenth 
century an Irish engineer, Robert Mallet, produced a list of 
6831 earthquakes and plotted their estimated locations, 
producing the first map of the world’s seismicity and estab¬ 
lishing that earthquakes occurred in distinct zones. He also 
used a four-stage intensity scale to grade earthquake 
damage, and constructed the first isoseismal maps with 
lines that outlined areas with broadly equal grades of 
damage. The Rossi-Forel intensity scale, developed in the 
late nineteenth century by the Italian scientist M. S. de 
Rossi and the Swiss scientist F. Forel, incorporated ten 
stages describing effects of increasing damage. In 1902 an 
Italian seismologist, G. Mercalli, proposed a still more 
extensive, expanded intensity scale which reclassified earth¬ 
quake severity in twelve stages. A variation, the Modified 
Mercalli (MM) scale, was developed in 1931 to suit build¬ 
ing conditions in the United States, where a later modifica¬ 
tion is in common use. The Medvedev-Sponheuer-Karnik 
(MSK) scale, introduced in Europe in 1964, and modified 
in 1981, also has twelve stages and differs from the MM 
scale mainly in details. A new European Macroseismic 
Scale (EMS-98) was adopted in 1998; an abridged version 


is shown in Table 3.1. The new 12-stage EMS scale is based 
on the MSK scale but takes into account the vulnerability 
of buildings to earthquake damage and incorporates more 
rigorous evaluation of the extent of damage to structures 
with different building standards. 

In order to evaluate the active seismicity of a region 
questionnaires may be distributed to the population, 
asking for observations that can be used to estimate the 
intensity experienced. The questionnaires are evaluated 
with the aid of an intensity scale, and the intensity 
recorded at the location of each observer is plotted on a 
map. Continuous lines are then drawn to outline places 
with the same intensity (Fig. 3.45), in the same way that 
contour lines are used on topographic maps to show eleva¬ 
tion. Comparison of the isoseismal maps with geological 
maps helps explain the response of the ground to the shake 
of an earthquake. This is valuable information for under¬ 
standing earthquake risk. The foundation on which struc¬ 
tures are erected plays a vital role in their survival of an 
earthquake. For example, soft sediments can amplify the 
ground motion, enhancing the damage caused. This is even 
more serious when the sediments have a high water 
content, in which case liquefaction of the sediments can 
occur, robbing structures built on them of support and 
promoting their collapse. 
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Fig. 3.45 Isoseismal map with contours of equal intensity for the New 
Madrid, Missouri, earthquake of 1811 (after Nuttli, 1973). 


There are numerous examples of this having occurred. 
In the great Alaskan earthquake in 1964 a section of 
Anchorage that was built on a headland underlain by a wet 
clay substratum collapsed and slid downslope to the sea. In 
1985 a very large earthquake with magnitude 8.1 struck the 
Pacific coast of Mexico. About 350 km away in Mexico 
City, despite the large distance from the epicenter, the 
damage to buildings erected on the alluvium of a drained 
lakebed was very severe while buildings set on a hard rock 
foundation on the surrounding hills suffered only minor 
damage. In the Loma Prieta earthquake of 1989 severe 
damage was caused to houses built on landfill in San 
Francisco’s Mission district, and an overhead freeway built 
on pillars on young alluvium north of Oakland collapsed 
dramatically. Both regions of destruction were more than 
70 km from the epicenter in the Santa Cruz mountains. 
Similarly, in the San Francisco earthquake of 1906 worse 
damage occurred to structures built on landfill areas 
around the shore of the bay than to those with hard rock 
foundations in the hills of the San Francisco peninsula. 

Intensity data play an important role in determining 
the historic seismicity of a region. An earthquake has dra¬ 
matic consequences for a population; this was especially 
the case in the historic past, when real hazards were aug¬ 
mented by superstition. The date (and even the time) of 
occurrence of strong earthquakes and observations of 
their local effects have been recorded for centuries in 
church and civil documents. From such records it is some¬ 
times possible to extract enough information for a given 


earthquake to estimate the intensity experienced by the 
observer. If the population density is high enough, it may 
be possible to construct an isoseismal map from which the 
epicenter of the tremor may be roughly located. An inter¬ 
esting example of this kind of analysis is the study of the 
New Madrid earthquakes of 1811-1812, which caused 
devastation in the Mississippi valley and were felt as far 
away as the coastlines along the Atlantic and Gulf of 
Mexico (Fig. 3.45). There were probably three large earth¬ 
quakes, but the events occurred before the invention of the 
seismograph so details of what happened are dependent 
on the subjective reports of observers. Historical records 
of the era allow development of an intensity map for the 
settled area east of the Mississippi, but the pioneering 
population west of the river was at that time too sparse to 
leave adequate records for intensity interpretation. On the 
basis of the available evidence these earthquakes are esti¬ 
mated to have had magnitudes of 7.8-8.1. 

Earthquake intensity data are valuable for the con¬ 
struction of seismic risk maps, which portray graphically 
the estimated earthquake hazard of a region or country. 
The preparation of a seismic risk map is a lengthy and 
involved task, which typically combines a study of the 
present seismicity of a region with analysis of its historic 
seismicity. A map of the maximum accelerations experi¬ 
enced in recent seismicity helps to identify the areas that 
are most likely to suffer severe damage in a large earth¬ 
quake. The likelihood of an earthquake happening in a 
given interval of time must also be taken into account. 
Even in a region where large earthquakes occur, they 
happen at irregular intervals, and so knowledge of local 
and regional earthquake frequency is important in risk 
estimation. A seismic risk map of the United States (Fig. 
3.46) shows the peak accelerations that are likely to be 
experienced at least once in a 50-year period. 

Devastating earthquakes can occur even in countries 
with relatively low seismic risk. Switzerland, in the 
center of Europe, is not regarded as a country prone to 
earthquakes. The seismic activity consists mostly of small 
earthquakes, mainly in the Alps in the collision zone of the 
African and European plates. Yet, in 1356 the northern 
Swiss town of Basel was destroyed by a major earthquake. 
Seismic risk maps are useful in planning safe sites for 
important edifices like nuclear power plants or high dams 
for hydroelectric power, which supply a substantial propor¬ 
tion of Switzerland’s energy needs. Risk maps are also 
valuable to insurance companies, which must know the 
seismic risk of a region in order to assess the costs of earth¬ 
quake insurance coverage for private and public buildings. 

3.5.62 Earthquake magnitude 

Magnitude is an experimentally determined measure of 
the size of an earthquake. In 1935 C. F. Richter attempted 
to grade the sizes of local earthquakes in Southern 
California on the basis of the amplitude of the ground 
vibrations they produced at a known distance from the 
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Fig. 3.46 Seismic risk map of 
the United States. The 
numbers on contour lines give 
the maximum acceleration (in 
percent of g) that might be 
expected to be exceeded with 
a probability of 1 in 10 during 
a 50-year period (after Bolt, 
1988). 



epicenter. The vibrations were recorded by seismographs, 
which were standardized to have the same response to a 
given stimulus. Richter’s original definition of magnitude 
was based on surface-wave amplitudes (A s ) recorded by 
seismographs at an epicentral distance of 100 km. 
Because seismographs were located at various distances 
from the earthquake, an extra term was added to compen¬ 
sate for attenuation of the signal with increasing epicen¬ 
tral distance. Increasingly sensitive instruments allowed 
the recording of signals from distant earthquakes; those 
from events with epicentral distances greater than 20° are 
known as teleseismic signals. Originally, the magnitude 
was determined from the horizontal ground motion, 
because seismological stations were equipped mainly with 
horizontal-motion seismometers. However, the surface 
waves recorded by these instruments consist of super¬ 
posed Love and Rayleigh waves, which complicates the 
theoretical interpretation of the records. Vertical-motion 
seismometers record only the Rayleigh waves (together 
with P- and SV-waves), and so progressively the definition 
of surface-wave magnitude has come to be based on 
the vertical component of motion. The majority of 
surface-wave magnitudes assigned to earthquakes world¬ 
wide are now based on vertical-motion records. 

The International Association for Seismology and 
Physics of the Earth’s Interior (IASPEI) has adopted the 
following definition of the surface-wave magnitude (M s ) 
of an earthquake: 

M s = log 10 (^) + 1.66 log I0 ( A) + 3.3 (3 ’ 90) 

where A s is the vertical component of the ground motion 
in micrometers (|xm) determined from the maximum 
Rayleigh-wave amplitude, T is the period of the wave 
(18-22 seconds), A is the epicentral distance in degrees 
(20° < A< 160°), and where the earthquake has a focal 
depth of less than 50 km. A similar equation to Eq. (3.90) 


applies to broadband recordings; in this case {AJT) max 
corresponds to the maximum ground velocity. The 
surface-wave magnitudes of some important historical 
earthquakes are given in Table 3.2. 

The depth of the source affects the nature of the 
seismic wave train, even when the same energy is released. 
An earthquake with a deep focus may generate only a 
small surface-wave train, while shallow earthquakes 
cause very strong surface waves. Equation (3.90) for M s 
was derived from the study of shallow earthquakes, 
observed at a distance greater than 20°. Therefore, correc¬ 
tions must be made to the computed value of M s to com¬ 
pensate for the effects of a focal depth greater than 50 km 
or epicentral distance less than 20°. 

The amplitude of body waves is not sensitive to the 
focal depth. As a result, earthquake magnitude scales 
have also been developed for use with body waves. An 
equation, proposed by B. Gutenberg in 1945, can be used 
to calculate a body-wave magnitude (m b ) from the 
maximum amplitude ( A p ) of the ground motion associ¬ 
ated with P-waves having a period ( T ) of less than 3 s: 

m h = logi 0 (^| + Q(A,/j) (3 ' 91 ) 

Where Q(A, /;) is an empirical correction for signal attenu¬ 
ation due to epicentral distance (A) and focal depth (h) 
that is made by reading directly from a graph or table of 
values. 

For some earthquakes both M s and m b can be calculated. 
Unfortunately, the different estimates of magnitude often 
do not agree well, except for small earthquakes. This is due 
to the way the ground responds to a seismic event, and to the 
different natures of body waves and surface waves. Body 
waves have a different dependence of amplitude on fre¬ 
quency than do surface waves. m b is estimated from a high- 
frequency (1 Hz) phase whereas M is determined from 
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Table 3.2 Some important historical earthquakes, with their surface- 
the numbers of fatalities 

■wave 

magnitudes M v , moment magnitudes M w , and 

Year 

Epicenter 

M s 

Magnitude 

K 

Fatalities 

1906 

San Francisco, California 

8.3 

7.8 

3,000 

1908 

Messina, Italy 

7.2 

— 

70,000 

1923 

Kanto, Japan 

8.2 

7.9 

143,000 

1952 

Kamchatka, Russia 

8.2 

9.0 


1957 

Andreanof Islands, Alaska 

8.1 

8.6 


1960 

Valdivia, Chile 

8.5 

9.5 

5,700 

1960 

Agadir, Morocco 

5.9 

5.7 

10,000 

1964 

Prince William Sound, Alaska 

8.6 

9.2 

125 

1970 

Chimbote, Peru 

7.8 

7.9 

66,000 

1971 

San Fernando, California 

6.5 

6.7 

65 

1975 

Haicheng, China 

7.4 

7.0 

—300 

1976 

Tangshan, China 

7.8 

7.5 

243,000 

1980 

El Asnam, Algeria 

7.3 

— 

2,590 

1985 

Michoacan, Mexico 

8.1 

8.0 

9,500 

1989 

Loma Prieta, California 

7.1 

6.9 

62 

1994 

Northridge, California 

6.8 

6.7 

60 

1999 

Ismit, Turkey 

— 

7.6 

17,100 

2004 

Sumatra-Andaman Islands 

— 

9.0 

250,000 


low-frequency (0.05 Hz) vibrations. Above a certain size, 
each method becomes insensitive to the size of the earth¬ 
quake, and exhibits magnitude saturation. This occurs for 
body-wave estimates at around m h = 6; all larger earth¬ 
quakes give the same body-wave magnitude. Similarly, 
surface-wave estimates of magnitude saturate at M s = 8. 
Thus, for very large earthquakes, M s and m h underestimate 
the energy released. An alternative definition of magnitude, 
based upon the long-period spectrum of the seismic wave, is 
preferred for very large earthquakes. It makes use of the 
physical dimensions of the focus. 

As discussed in the elastic rebound model (Section 3.1), 
a tectonic earthquake arises from abrupt displacement of 
a segment of a fault. The area 5 of the fractured segment 
and the amount by which it slipped D can be inferred. 
Together with the rigidity modulus yi of the rocks adjacent 
to the fault, these quantities define the seismic moment M 0 
of the earthquake. Assuming that the displacement and 
rigidity are constant over the area of the rupture: 

M 0 = /a5D (3.92) 

The seismic moment can be used to define a moment 
magnitude (M w ). The definition adopted by the responsi¬ 
ble commission of IASPEI is: 

M w = |(logi 0 M 0 -9.1) (3.93) 

In this equation M 0 is in N m. If c.g.s. units are used 
instead of SI units, M 0 is expressed in dyne cm and the 
corresponding equation is: 

M w = |(log 10 M 0 -16.1) (3.94) 


M w is more appropriate for describing the magnitudes 
of very large earthquakes. It has largely replaced M s in 
scientific evaluation of earthquake size, although M s is 
often quoted in reports in the media. The moment magni¬ 
tudes and surface-wave magnitudes of some historical 
earthquakes are listed in Table 3.2. 

The magnitude scale is, in principle, open ended. 
Negative Richter magnitudes are possible, but the limit of 
sensitivity of seismographs is around —2. The maximum 
possible magnitude is limited by the shear strength of the 
crust and upper mantle, and since the beginning of instru¬ 
mental recording none has been observed with a surface- 
wave magnitude M s as high as 9. However, this is largely 
due to a saturation effect resulting from the method by 
which surface-wave magnitudes M s are computed. 
Seismic moment magnitudes M w of 9 and larger have 
been computed for some giant earthquakes (Table 3.2). 
The largest in recorded history was the 1960 Chile earth¬ 
quake with M w = 9.5. 

When an earthquake occurs on a fault, the ruptured 
area spreads in size from the point of initial failure, akin 
to opening a zip fastener. If the fault ruptures along a 
length L , and the fractured segment has a down-dip 
dimension w (referred to as the width of the faulted zone), 
the area S is equal to wL. Assuming that the aspect ratio of 
faults is constant (i.e., w is proportional to L) then the rup¬ 
tured area S' is proportional to L 2 . This is a generalization, 
because different faults have different aspect ratios. 
Similarly, if the stress drop in an earthquake is constant, 
the displacement on the fault, D , can be assumed propor¬ 
tional to L. Together, these assumptions imply that the 
seismic moment M 0 scales as ZA Assuming that S °c L 2 , the 
seismic moment scales as S 3/2 . This inference is supported 
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Fig. 3.47 Correlation of the ruptured area (S, in km 2 ) with the seismic 
moment (M 0 , in N m) and moment magnitude (MJ for some shallow 
earthquakes (after Kanamori and Brodsky, 2004). 


by the correlation between seismic moment and the rup¬ 
tured area S for many shallow earthquakes (Fig. 3.47). 

The length L of the rupture zone determines how long 
the ground motion lasts in an earthquake. This factor has 
a significant bearing on the extent of damage to struc¬ 
tures during moderate to strong earthquakes. Theory 
gives the speed of propagation of the rupture as 75-95% 
of the shear wave velocity. Assuming that the rupture 
propagates along the fault at about 2.5-3.0 km s -1 , the 
duration of the ground motion can be estimated for an 
earthquake of a given magnitude or seismic moment (Fig. 
3.48). For example, close to the epicenter of a magnitude 
5 earthquake the vibration may last a few seconds, 
whereas in a magnitude 8 earthquake it might last about 
50 s. Flowever, there is a lot of scatter about the ideal cor¬ 
relation. In the great Sumatra earthquake, with magni¬ 
tude 9.0, the rupture continued for some 500 s. 

The seismic moment M 0 of an earthquake is deter¬ 
mined in practice from the analysis of seismograms at a 
large number of seismic stations (currently 137) distributed 
world-wide and linked to form the Global Seismographic 
Network (GSN). The waveform of each seismogram is a 
product of the epicentral distance, focal depth and radia¬ 
tion pattern of the earthquake (Section 3.5.4.1). It is also 
affected by source parameters such as the area, orientation 
and amount of slip of the ruptured segment, factors which 
determine the seismic moment M 0 . Inversion of the seis¬ 
mogram data leads to an understanding of the earthquake 
source. At large distances and for long-period components 
of the seismogram (e.g., T>40 s) that correspond to wave¬ 
lengths much longer than the dimensions of the faulted 
area, the source can be considered as a point source. 
Assuming a radiation model for the source and a velocity 
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Fig. 3.48 Correlation of the source durations (in seconds) of shallow 
earthquakes and their seismic moments, M 0 , (in N m) and moment 
magnitudes, M w (after Kanamori and Brodsky, 2004). 

model for the propagation, a synthetic seismogram (see 
also Section 3.6.5.4) can be computed for each station and 
compared with the observed waveform. Iterative adjust¬ 
ment of the source parameters to give a best fit to the wave¬ 
forms at a number of receivers defines mathematically a 
moment tensor. The tensor can be visualized geometrically 
as an ellipsoid in which the lengths and orientations of the 
axes correspond to the moment of the earthquake and the 
directions of the tensional (T) axis and compressional (P) 
axis at the source. The location of the system of couples 
modelled by the moment tensor is the optimum point 
source location for the earthquake and is called its centroid. 
The centroid moment tensor (CMT) analyses of earth¬ 
quakes are made rapidly and give the source parameters of 
an earthquake within a few hours. The centroid location 
may differ from the hypocenter of the earthquake (the 
place where rupture first occurred). The former is based on 
analysis of the full seismogram waveform, the latter only 
on the first arrival times. So, for example, the centroid of 
the 2004 Sumatra-Andaman earthquake was located 
about 160 km west of its epicenter. 

3.5.63 Relationship between magnitude and intensity 

The intensity and magnitude scales for estimating the size 
of an earthquake are defined independently but they have 
some common features. Intensity is a measure of earth¬ 
quake size based on the extent of local damage it causes at 
the location of an observer. The definition of magnitude is 
based on the amplitude of ground motion inferred from 
the signal recorded by the observer’s seismograph, and of 
course it is the nature of the ground motion - its amplitude, 
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Table 3.3 Average number of earthquakes per year world¬ 
wide since 1990, except for M 8 which are averaged since 
1900 (based on data from the US Geological Survey 
National Earthquake Information Center). The mean 
annual release of seismic energy is estimated using the 
energy-magnitude relation in Eq. (5. 97) 


Earthquake 

magnitude 

Number 
per year 

Annual energy 
[10 15 J yr _1 ] 

>8.0 

«i 

= 100 

7-7.9 

17 

190 

6-6.9 

134 

45 

5-5.9 

1,319 

14 

4^1.9 

« 13,000 

4 

3-3.9 

« 130,000 

1 

2-2.9 

« 1,300,000 

0.4 


velocity and acceleration - which produce the local damage 
used to classify intensity. However, in the definition of 
magnitude the ground-motion amplitude is corrected for 
epicentral distance and converted to a focal characteristic. 
Isoseismal maps showing the regional distribution of 
damage give the maximum intensity (7 max ) experienced in 
an earthquake, which, although influenced by the geo¬ 
graphic patterns of population and settlement, is usually 
near to the epicenter. A moderately strong, shallow-focus 
earthquake under a heavily populated area can result in 
higher intensities than a large deep focus earthquake under 
a wilderness area (compare, for example, the death tolls for 
the 1960 Agadir magnitude 5.7 and 1964 Alaskan magni¬ 
tude 9.2 earthquakes in Table 3.2). However, for earth¬ 
quakes with focal depth /? < 50 km the dependence of 7 max 
on the focal depth can be taken into account, and it is pos¬ 
sible to relate the maximum intensity to the magnitude 
with an empirical equation (Karnik, 1969): 

4ax= 1.5M s -1.81og 10 /*+ 1.7 (3.95) 

This type of equation is useful for estimating quickly the 
probable damage that an earthquake causes. For example, 
it predicts that in the epicentral region of an earthquake 
with magnitude 5 and a shallow focal depth of 10 km, the 
maximum MSK intensity will be VII (moderately serious 
damage), whereas, if the focal depth is 100 km, a maximum 
intensity of only IV-V (minor damage) can be expected. 

3.5.7 Earthquake frequency 

Every year there are many small earthquakes, and only a 
few large ones. According to a compilation published by 
Gutenberg and Richter in 1954, the mean annual number 
of earthquakes in the years 1918-1945 with magnitudes 
4^1.9 was around 6000, while there were only an average 
of about 100 earthquakes per year with magnitudes 
6-6.9. The relationship between annual frequency (TV) 
and magnitude ( M s ) is logarithmic and is given by an 
equation of the form 



Magnitude (Ms) 



Year 


Fig. 3.49 Histograms of (a) the logarithm of the number (A/) of 
earthquakes per year with magnitude M s , and (b) the annual number 
of earthquakes with magnitudes M s >7 since 1900 (based on data 
from the US Geological Survey National Earthquake Information 
Center). 


log N = a — bM s (3.96) 

The value of a varies between about 8 and 9 from one 
region to another, while b is approximately unity for 
regional and global seismicity. The mean annual numbers 
of earthquakes in different magnitude ranges are listed in 
Table 3.3; the frequency decreases with increasing magni¬ 
tude (Fig. 3.49a), in accordance with Eq. (3.96). The 
annual number of large earthquakes with magnitude 
M s >7 in the years 1900-1989 has varied between 
extremes of about 10 and 40, but the long-term average is 
about 20 per year (Fig. 3.49b). 

3.5.8 Energy released in an earthquake 

The definition of earthquake magnitude relates it to the 
logarithm of the amplitude of a seismic disturbance. 
Noting that the energy of a wave is proportional to the 
square of its amplitude it should be no surprise that the 
magnitude is also related to the logarithm of the energy. 
Several equations have been proposed for this relation¬ 
ship. An empirical formula worked out by Gutenberg 
(1956), relates the energy release E to the surface-wave 
magnitude M s : 
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log 10 £ = 4.8 + 1.5M s (3.97) 

where E is in joules. The logarithmic nature of the formula 
means that the energy release increases very rapidly with 
increasing magnitude. For example, when the magnitudes 
of two earthquakes differ by 1, their corresponding 
energies differ by a factor 32 (= 10 1 - 5 ). Hence, a magnitude 
7 earthquake releases about 1000 (= 10 (1 - 5x2) ) times the 
energy of a magnitude 5 earthquake. Another way of 
regarding this observation is that it takes 1000 magnitude 
5 earthquakes to release the same amount of energy as a 
single large earthquake with magnitude 7. Multiplying the 
mean number of earthquakes per year by their estimated 
energy (using one of the energy-magnitude equations) 
gives an impression of the importance of very large earth¬ 
quakes. Table 3.3 shows that the earthquakes with M s >7 
are responsible for most of the annual seismic energy. In a 
year in which a very large earthquake (M s > 8) occurs, 
most of the annual seismic energy is released in that single 
event. 

It is rather difficult to appreciate the amount of energy 
released in an earthquake from the numerical magnitude 
alone. A few examples help illustrate the amounts of 
energy involved. Earthquakes with M s = 1 are so weak 
that they can only be recorded instrumentally; they are 
referred to as microearthquakes. The energy associated 
with one of these events is equivalent to the kinetic energy 
of a medium sized automobile weighing 1.5 tons which is 
travelling at 130 km h -1 (80 m.p.h.). The energy released 
by explosives provides another means of comparison, 
although the conversion of energy into heat, light and 
shock waves is proportionately different in the two phe¬ 
nomena. One ton of the explosive trinitrotoluene (TNT) 
releases about 4.2 X 10 9 joules of energy. Equation (3.97) 
shows that the 11 kiloton atomic bomb which destroyed 
Hiroshima released about the same amount of energy as 
an earthquake with magnitude 5.9. The energy released in 
a 1 megaton nuclear explosion is equivalent to an earth¬ 
quake with magnitude 7.2; the 2004 Sumatra earthquake 
(magnitude 9.0) released an amount of energy equivalent 
to the detonation of 475 megaton bombs. 

3.5.9 Earthquake prediction 

The problem of earthquake prediction is extremely 
difficult and is associated with sundry other problems of 
a sociological nature. To predict an earthquake correctly 
means deciding, as far in advance as possible, exactly 
where and when it will occur. It is also necessary to judge 
how strong it will be, which means realistically that 
people want to know what the likely damage will be, a 
feature expressed in the earthquake intensity. In fact the 
geophysicist is almost helpless in this respect, because at 
best an estimate of the predicted magnitude can be made. 
As seen above, even if it is possible to predict accurately 
the magnitude, the intensity depends on many factors 
(e.g., local geology, construction standards, secondary 


effects like fires and floods) which are largely outside the 
influence of the seismologist who is asked to presage the 
seriousness of the event. The problem of prediction 
rapidly assumes sociological and political proportions. 
Even if the approximate time and place of a major earth¬ 
quake can be predicted with reasonable certainty, the 
question then remains of what to do about the situation. 
Properly, the threatened area should be evacuated, but 
this would entail economic consequences of possibly 
enormous dimension. 

The difficulties are illustrated by the following possible 
scenario. Suppose that seismologists conclude that a very 
large earthquake, with probable magnitude 7 or greater, 
will take place sometime in a known month of a given year 
under a specific urban center. No more precise details are 
possible; in particular, the time of occurrence cannot be 
determined more exactly. Publication of this kind of pre¬ 
diction would cause great consternation, even panic, 
which for some of the population could be as disastrous as 
the earthquake itself. Should the warning be withheld? If 
the entire area, with its millions of inhabitants, were to be 
evacuated, the economic dislocation would be enormous. 
How long should the evacuation last, when every day is 
economically ruinous? Clearly, earthquake prediction is 
only useful in a practical sense when it is accurate in both 
place and time. The responsible authorities must also be 
provided with a reasonable estimate of the expected mag¬ 
nitude, from which the maximum intensity 7 may be 
gauged with the aid of a relationship like that of Eq. 
(3.95). The problem then passes out of the domain of the 
scientist and into that of the politician, with the scientist 
retaining only a peripheral role as a consultant. But if the 
earthquake prediction is a failure, the consequences will 
rebound with certainty on the scientist. 

3.5.9.1 Prediction of the location of an earthquake 

It is easier to predict where a major earthquake is likely to 
occur than when it will occur. The global seismicity pat¬ 
terns demonstrate that some regions are relatively aseis- 
mic. They are not completely free of earthquakes, many 
of which appear to occur randomly and without warning 
in these regions. Some intraplate earthquakes have a 
history of repeated occurrence at a known location, 
which can sensibly be expected to be the locus of a future 
shock. However, because most earthquakes occur in the 
seismically active zones at plate margins, these are the 
prime areas for trying to predict serious events. Predicting 
the location of a future earthquake in these zones com¬ 
bines a knowledge of the historical seismicity pattern 
with the elastic rebound model of what causes an earth¬ 
quake. 

The seismic gap theory is based on the simple idea that 
global plates move under the influence of forces which 
affect the plates as entities. The interactions at a plate 
margin therefore act along the entire length of the inter¬ 
plate boundary. Models of plate tectonic reconstructions 
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Fig. 3.50 Seismic gaps along 
the Aleutian island arc. 
Shaded regions mark the 
areas of rupture of very large 
historic earthquakes ( M s or 
M w > 7.4). Three large gaps in 
seismicity are potential 
locations of a future large 
earthquake (after Sykes eta!., 
1981). 
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assume continuity of plate motions on a scale of millions of 
years. However, the global seismicity patterns show that on 
the scales of decades or centuries, the process is discontinu¬ 
ous both in time and place. This is because of the way indi¬ 
vidual earthquakes occur. According to the elastic rebound 
model, stress accumulates until it exceeds the local strength 
of the rocks, rupture produces motion on the fault, and an 
earthquake occurs. During the time of stress accumulation, 
the area experiences no major earthquake, and the regional 
pattern of seismicity shows a local gap (Fig. 3.50). This is 
the potential location for an earthquake that is in the 
process of accumulating the strain energy necessary to 
cause rupture. For example, the magnitude 7.6 earthquake 
at Izmit, Turkey, in 1999 resulted in more than 11,000 
deaths. It occurred in a 100-150 km seismic gap between 
events that occurred more than three decades earlier, in 
1967 and 1964, and which subsequently had been tranquil. 

3.5.92 Prediction of the time and size of an earthquake 

Seismic gap theory holds great promise as a means of 
determining where an earthquake is likely to occur. 
Unfortunately, it does not help to predict when it will 
occur or how large it will be. These factors depend on the 
local strength of the rocks and the rate at which strain 
accumulates. There are various ways to observe the effects 
of strain accumulation, but the largely unknown factor of 
local breaking strength of the rocks hinders prediction of 
the time of an earthquake. 

The strain accumulation results in precursory indica¬ 
tions of both sociological and scientific nature. The 
People’s Republic of China has suffered terribly from the 
ravages of earthquakes, partly because of the unavoidable 
use of low-quality construction materials. In 1966 the 
highly disciplined society in the People’s Republic of 
China was marshalled to report any strange occurrences 
associated with earthquakes. They noticed, for example, 
that wells and ponds bubbled, and sometimes gave off 
odors. Highly intriguing was the odd behavior of wild and 
domestic animals prior to many earthquakes. Dogs howled 
unaccountably, many creatures fell into panic, rats and 
mice left their holes, snakes abandoned their dens; even fish 
in ponds behaved in an agitated manner. It is not known 
how these creatures sense the imminent disaster, and the 
qualitative reports do not lend themselves to convenient 


statistical evaluation. However, prediction by scientific 
methods is still uncertain, so the usefulness of alternative 
premonitory phenomena cannot be rejected out of hand. 

Several scientific methods have been tested as possible 
ways of predicting the time of earthquake occurrence. 
They are based on detecting ground changes, or effects 
related to them, that accompany the progression of strain. 
For example, a geochemical method which has had some 
degree of success is the monitoring of radon. Some miner¬ 
als in the Earth’s crust contain discrete amounts of 
uranium. The gas radon is a natural product of the radioac¬ 
tive decay of uranium. It migrates through pores and 
cracks and because of its own radioactivity it is a known 
environmental health hazard in buildings constructed in 
some geographic areas. Radon gas can become trapped in 
the Earth’s crust, and in many areas it forms a natural 
radioactive background. Prior to some earthquakes anom¬ 
alous levels of radon have been detected. The enhanced 
leakage of radon from the crust may be due to porosity 
changes at depth in response to the accumulating strains. 

The build-up of strain is manifest in horizontal and 
vertical displacements of the Earth’s surface, depending 
on the type of faulting involved. These displacements can 
be measured geodetically by triangulation or by modern 
techniques of trilateration which employ laser-ranging 
devices to measure the time taken for a laser beam to 
travel to a reflecting target and back to its source. The 
travel-time of the beam is measured extremely accurately, 
and converted into the distance between emitter and 
reflector. A shift on the fault will change this distance. 
With laser techniques the constant creep of one side of a 
horizontal fault relative to the other can be observed 
accurately. In one method the source and receiver of a 
laser beam are placed on one side of a fault, with a reflec¬ 
tor on the opposite side. In an alternative method a laser 
source and receiver are placed on each side of the fault. 
Pulsed signals are beamed from each unit to an orbiting 
reflecting satellite, the position of which is known accu¬ 
rately. For each ground station, differences in the elapsed 
time of the pulse are converted by computer into ground 
movement, and into differential motion on the fault. 

Several methods are suited to detecting differential verti¬ 
cal motion. Sensitive gravimeters on opposite sides of a 
fault can detect vertical displacement of as little as one cen¬ 
timeter of one instrument relative to the other. Distension 
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of the Earth can be monitored with a tiltmeter, which is an 
instrument designed on the principle of a water-level. It 
consists of a long tube about 10 m in length, connecting 
two water-filled containers, in which the difference in water 
levels is monitored electronically. A tiltmeter is capable of 
determining tilt changes of the order of 10 -7 degrees. 
Tiltmeters and gravimeters installed near to active faults 
have shown that episodes of ground uplift and tilt can 
sometimes precede major earthquakes. 

Geodetic and geophysical observations, such as changes 
in the local geomagnetic field or the electrical resistivity of 
the ground, are of fundamental interest. However, the most 
promising methods of predicting the time and size of an 
imminent earthquake are based on seismic observations. 
According to the elastic rebound model, the next earth¬ 
quake on an active fault (or fault segment) will occur when 
the stress released in the most recent earthquake again 
builds up to the local breaking point of the rocks. Thus the 
probability at any time that an earthquake will occur on the 
fault depends on the magnitude of the latest earthquake, 
the time elapsed since it occurred and the local rate of accu¬ 
mulation of stress. A symptom of the stress build-up before 
a major earthquake can be an increase in foreshock activity, 
and this was evidently a key parameter in the successful 
prediction of a large earthquake in Liaonping province, 
China, in February, 1975. The frequency of minor shocks 
increased, at first gradually but eventually dramatically, 
and then there was an ominous pause in earthquake activ¬ 
ity. Chinese seismologists interpreted this “time gap” as an 
indication of an impending earthquake. The population 
was ordered to vacate their homes several hours before the 
province was struck by a magnitude 7.4 earthquake that 
destroyed cities and communities. Because of the successful 
prediction of this earthquake, the epicenter of which was 
near to the city of Haicheng, the death toll among the 
3,000,000 inhabitants of the province was very low. 

A technique, no longer in favor, but which at one time 
looked promising for predicting the time and magnitude 
of an earthquake is the dilatancy hypothesis, based upon 
systematic variations in the ratio of the travel-times of P- 
waves and S-waves which originated in the focal volume 
of larger shocks. Russian seismologists noticed that prior 
to an earthquake the travel-time ratio tjt changed sys- 
tematically: at first it decreased by up to 5%, then it 
returned to normal values just before the earthquake. 

The observations have been attributed to changes in the 
dilatancy of the ground. Laboratory experiments have 
shown that, before a rock fractures under stress, it develops 
minute cracks which cause the rocks to dilate or swell. This 
dilatancy alters the P-wave velocity, which drops initially 
(thereby increasing t ) as, instead of being water filled, the 
new volume of the dilated pores at first fills with air. Later, 
water seeps in under pressure, replaces the air and the tjt„ 
ratio returns to normal values. At this point an earthquake 
is imminent. The time for which the ratio remains low is a 
measure of the strain energy that is stored and therefore a 
guide to the magnitude of the earthquake that is eventually 


unleashed. Initial success in predicting small earthquakes 
with the dilatancy model led to a period of optimism that 
an ultimate solution to earthquake prediction had been 
found. Unfortunately, it has become apparent that the dila¬ 
tancy effect is not universal, and its importance appears to 
be restricted only to certain kinds of earthquakes. 

In summary, present seismicity patterns, in conjunction 
with our knowledge of where historic earthquakes have 
occurred, permit reasonable judgements of where future 
earthquakes are most likely to be located. However, despite 
years of effort and the investigation of various scientific 
methods, it is still not possible to predict reliably when an 
earthquake is likely to happen in an endangered area. 

3.5.10 Earthquake control 

Earthquakes constitute a serious natural environmental 
hazard. Despite great efforts by scientists in various 
countries, successful prediction is not yet generally possi¬ 
ble. Consequently, the protection of people against 
seismic hazard depends currently on the identification of 
especially perilous areas (such as active faults), the avoid¬ 
ance of these as the sites of constructions, the develop¬ 
ment and enforcement of appropriate building codes, and 
the education and training of the population in emer¬ 
gency procedures to be followed during and in the after¬ 
math of a shock. Unfortunately, many densely populated 
regions are subject to high seismic risk. It is impossible to 
prevent the cumulation of strain in a region subject to tec¬ 
tonic earthquakes; the efforts of the human race are not 
likely to have much effect on the processes of plate tecton¬ 
ics! However, it may be possible to influence the manner 
in which the strain energy is released. The catastrophic 
earthquakes are those in which a huge amount of strain 
energy that has accumulated over a long period of time is 
suddenly released in a single event. If the energy could be 
released progressively over a longer period of time in 
many smaller shakes, the violence and disastrous conse¬ 
quences of a major earthquake might be avoided. The 
intriguing possibility of this type of earthquake control 
has been investigated in special situations. 

In 1962 the US Army began to dispose of liquid toxic 
waste from the manufacture of chemical weapons by 
injection into a well, more than 3 km deep, near Denver, 
Colorado. Although the region had been devoid of earth¬ 
quake activity for the preceding 80 years, earthquakes 
began to occur several weeks after pumping started. Until 
1965, when waste injection was halted, more than 1000 
earthquakes were recorded. They were mostly very small, 
of the microearthquake category, but some had Richter 
magnitudes as high as 4.6. When pumping was halted, the 
seismicity ceased; when pumping was resumed, the earth¬ 
quake activity started anew. It was conjectured that the 
liquid waste had seeped into old faults, and by acting as a 
kind of lubricant or by increasing the pore pressure, had 
repeatedly permitted slippage, with an accompanying 
small earthquake. This incident suggested that it might be 
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Fig. 3.51 Comparison of P- 
wave radiation patterns and 
the relative amplitudes of 
long-period and short-period 
surface waves for an 
earthquake and a nuclear 
explosion (after Richards, 
1989). 
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possible to control fault motion, either by injecting fluids 
to lubricate sections of the fault plane, or by pumping out 
fluid to lock the fault. This opened the intriguing possibil¬ 
ity that, by making alternate use of the two processes, the 
slippage on a fault might be controlled so that it took 
place by a large number of small earthquakes rather than 
by a few disastrous earthquakes. 

In 1969 the US Geological Survey carried out a test of 
this lubrication effect in the depleted Rangely oil field in 
western Colorado. There were many disused wells in the 
oil field, through which fluids were pumped in and out of 
the ground over a considerable area. Meanwhile the local 
seismicity was monitored. This controlled test agreed 
with the observations at the Denver site, confirming that 
the earthquake activity correlated with the amount of 
fluid injected. Moreover, it was established that the earth¬ 
quake activity increased when the pore pressure exceeded 
a critical threshold value, and it ceased when the pressure 
dropped below this value as the fluid was withdrawn. 
Despite the apparent success of this experiment, it was 
agreed that further testing was necessary to explore the 
validity of the method. An obvious difficulty of testing 
the modification of seismic activity on critical faults is 
that the tests must be made in remote areas so as to avoid 
costly damage caused by the testing. The conditions 
under which the method may be applicable have not been 
established definitively. 

3.5.11 Monitoring nuclear explosions 

Since 1963 most nuclear explosions have been conducted 
underground to prevent the dangerous radioactive fallout 
that accompanies nuclear explosions conducted under¬ 
water or in the atmosphere. The detection and monitoring 
of such testing activity became important tasks for seis¬ 
mologists. The bilateral Threshold Test Ban Treaty of 
1974 between the former Soviet Union and the USA pro¬ 
hibited underground testing of nuclear devices with a 
yield greater than 150 kilotons of TNT equivalent, which 
corresponds roughly to an earthquake with magnitude 


about 6. Current efforts are underway to establish a 
global monitoring system to verify compliance with a 
future Comprehensive Test Ban Treaty. The system 
should detect nuclear explosions with a yield as low as 
one kiloton (a well-coupled kiloton explosion has a mag¬ 
nitude of around 4). Detection of these events at dis¬ 
tances of several thousand kilometers, and discriminating 
them from the approximately 7000 earthquakes that 
occur annually with magnitudes of 4 or above (Table 3.3) 
poses a monumental challenge to seismologists. 

In order to achieve the high detection capability needed 
to monitor underground testing, many so-called seismic 
arrays have been set up. An array consists of several indi¬ 
vidual seismometers, with spacing on the order of a kilo¬ 
meter or less, that feed their output signals in parallel into 
a centralized data-processing center. By filtering, delaying 
and summing the signals of the individual instruments, 
the incoherent noise is reduced and the coherent signal is 
increased, thus improving the signal-to-noise ratio signifi¬ 
cantly over that for a single sensor. A local seismic distur¬ 
bance arrives at the array on an almost horizontal path 
and triggers the individual seismometers at successively 
different times, whereas a teleseismic arrival from a very 
distant source reaches all seismometers in the array at 
nearly the same time along a steeply inclined path. The 
development of seismic arrays permitted the analysis of 
distant weak events. The enhanced sensitivity led to 
several advances in seismology. Features of the deep struc¬ 
ture of the Earth (e.g., the inner core) could be investi¬ 
gated, earthquake location became more accurate and the 
analysis of focal mechanisms received a necessary 
impetus. These improvements were essential because of 
the need to identify correctly the features that distinguish 
underground nuclear explosions from small earthquakes. 

An earthquake is the result of sudden motion of crustal 
blocks on opposite sides of a fault-plane. The radiation 
pattern of P-wave amplitude has four lobes of alternating 
compression and dilatation (Fig. 3.51). The first motions 
at the surface of the Earth are either pushes away from the 
source or tugs toward it, depending on the geometry of the 
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focal mechanism. In contrast, an underground explosion 
causes outward pressure around the source. The first 
motions at the surface are all pushes away from the source. 
Hence, focal mechanism analysis provides an important 
clue to the nature of the recorded event. Moreover, an 
explosion produces predominantly P-waves, while earth¬ 
quakes are much more efficient in also generating surface 
waves. Consequently, the relative amplitudes of the long- 
period surface-wave part of the record and of the short- 
period P-wave part are much higher for an earthquake 
than for an explosion (Fig. 3.51). 

Further discrimination criteria are the epicentral loca¬ 
tion and the focal depth. Intraplate earthquakes are much 
less common than earthquakes at active plate margins, so 
an intraplate event might be suspected to be an explosion. 
If the depth of a suspicious event is determined with a high 
degree of confidence to be greater than about 15 km, one 
can virtually exclude that it is an explosion. Deeper holes 
have not been drilled due to the great technical difficulty, 
e.g., in dealing with the high temperatures at such depths. 

3.6 SEISMIC WAVE PROPAGATION 

3.6.1 Introduction 

A seismic disturbance is transmitted by periodic elastic dis¬ 
placements of the particles of a material. The progress of 
the seismic wave through a medium is determined by the 
advancement of the wavefront. We now have to consider 
how the wave behaves at the boundary between two media. 
Historically, two separate ways of handling this problem 
developed independently in the seventeenth century. One 
method, using Huygens' principle, describes the behavior of 
wavefronts; the other, using Fermat’s principle, handles the 
geometry of ray paths at the interface. The eikonal equation 
(Section 3.3.2.5) establishes that these two methods of 
treating seismic wave propagation are equivalent. 

In the Earth's crust the velocities of P- and S-waves are 
often proportional to each other. This follows from Eqs. 
(3.39) and (3.47), which give the body-wave velocities in 
terms of the Lame constants A and p. For many rocks, 
Poisson’s relation A = p. applies (see Box 3.1), and so 

5 am 

For brevity, the following discussion handles P-waves 
only, which are assumed to travel with velocities a l and a 2 
in the two media. However, we can equally apply the 
analyses to S-waves, by substituting the appropriate 
shear-wave velocities /3 1 and /3, for the media. 

3.6.2 Huygens'principle 

The passage of a wave through a medium and across inter¬ 
faces between adjacent media was first explained by the 
seventeenth century Dutch mathematician and physicist, 



Fig. 3.52 Application of Huygens' principle to explain the advance of a 
plane wavefront. The wavefront at CD is the envelope of wavelets set 
up by particle vibrations when the wavefront was at the previous 
position AB. Similarly, the envelope of wavelets set up by vibrating 
particles in the wavefront CD forms the wavefront EF. 

Christiaan Huygens, who formulated a principle for the 
propagation of light as a wave, rather than as the stream of 
particles visualized by his great and influential contempo¬ 
rary, Sir Isaac Newton. Although derived for the laws of 
optics, Huygens’ principle (1678) can be applied equally to 
any kind of wave phenomenon. The theory is based on 
simple geometrical constructions and permits the future 
position of a wavefront to be calculated if its present posi¬ 
tion is known. Huygens’ principle can be stated: "Allpoints 
on a wavefront can be regarded as point sources for the pro¬ 
duction of new spherical waves; the new wavefront is the 
tangential surface (or envelope) of the secondary wavelets. ” 
This principle can be illustrated simply for a plane 
wavefront (Fig. 3.52), although the method also applies to 
curved wavefronts. Let the wavefront initially occupy the 
position AB and let the open circles represent individual 
particles of the material in the wavefront. The particles are 
agitated by the arrival of the wavefront and act as sources 
of secondary wavelets. If the seismic velocity of the mater¬ 
ial is V, the distance travelled by each wavelet after time t is 
Vt and it describes a small sphere around its source parti¬ 
cle. If the original wavefront contained numerous closely 
spaced particles instead of a discrete number, the plane CD 
tangential to the small wavelets would represent the new 
position of the wavefront. It is also planar, and lies at a per¬ 
pendicular distance Vt from the original wavefront. In 
their turn the particles in the wavefront CD act as sources 
for new secondary wavelets, and the process is repeated. 
This principle can be used to derive the laws of reflection 
and refraction of seismic waves at an interface, and also to 
describe the process of diffraction by which a wave is 
deflected at a corner or at the edge of an object in its path. 

3.6.2.1 The law of reflection using Huygens' principle 

Consider what happens to a plane P-wave travelling in a 
medium with seismic velocity aq when it encounters the 
boundary to another medium in which the P-wave veloc¬ 
ity is a 2 (Fig. 3.53). At the boundary part of the energy of 
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Fig. 3.53 The reflection of a plane P-wave at an interface between two 
media with different seismic velocities: incident plane waves (e.g., AC); 
spherical wavelets set up in the upper medium by vibrating particles in 
the segment AB of the interface; and the reflected plane wave BD, 
which is the envelope of the wavelets. 

the incident wave is transferred to the second medium, 
and the remainder is reflected back into the first medium. 
If the incident wavefront AC first makes contact with the 
interface at A it agitates particles of the first medium at A 
and simultaneously the particles of the second medium in 
contact with the first medium at A. The vibrations of 
these particles set up secondary waves that travel away 
from A, back into the first medium as a reflected wave 
with velocity aq (and onward into the second medium as a 
refracted wave with velocity a ? ). 

By the time the incident wavefront reaches the interface 
at B all particles of the wavefront between A and B have 
been agitated. Applying Huygens’ principle, the wavefront 
of the reflected disturbance is the tangent plane to the sec¬ 
ondary wavelet in the first medium. In Fig. 3.53 this is rep¬ 
resented by the tangent BD from B to the circle centered at 
A, the first point of contact with the boundary. In the time 
t that elapses between the arrival of the plane wave at A 
and its arrival at B, the incident wavefront travels a dis¬ 
tance CB and the secondary wavelet from A travels the 
equal distance AD. The triangles ABC and ABD are con¬ 
gruent. It follows that the reflected wavefront makes the 
same angle with the interface as the incident wave. 

It is customary to describe the orientation of a plane 
by the direction of its normal. The angle between the 
normal to the interface and the normal to the incident 
wavefront is called the angle of incidence (/'); the angle 
between the normal to the interface and the normal to the 
reflected wavefront is called the angle of reflection (/'). 
This application of Huygens’ principle to plane seismic 
waves shows that the angle of reflection is equal to the 
angle of incidence (? = /'). This is known as the law of 
reflection. Although initially developed for light-beams, it 
is also valid for the propagation of seismic waves. 

3.62.2 The law of refraction using Huygens' principle 

The discussion of the interaction of the incident wave 
with the boundary can be extended to cover the part of 
the disturbance that travels into the second medium (Fig. 
3.54). This disturbance travels with the velocity a 2 of the 
second medium. Let t be the time taken for the incident 
wavefront in the first medium to advance from C to B; 
then BC = ajt. In this time all particles of the second 



Fig. 3.54 The refraction of a plane P-wave at an interface between two 
media with different seismic velocities n 1 and a 2 (>“i ): incident plane 
waves (e.g., AC); the spherical wavelets set up in the lower medium by 
vibrating particles in the segment AB of the interface; and the refracted 
plane wave BE, which is the envelope of the wavelets. The angles of 
incidence (/) and refraction (r) are defined between the normal to the 
interface and the respective rays. 


medium between A and B have been agitated and now act 
as sources for new wavelets in the second medium. When 
the incident wave reaches B, the wavelet from A in the 
second medium has spread out to the point E, where 
AE = a 1 t. The wavefront in the second medium is the 
tangent BE from B to the circle centered at A. The angle 
of incidence (/) is defined as before; the angle between the 
normal to the interface and the normal to the transmitted 
wavefront is called the angle of refraction (r). 
Comparison of the triangles ABC and ABE shows that 
BC = AB sin i, and AE = AB sin r. Consequently, 


AB sin; _ BC_ a \ t 
AB sinr AE a 2 t 


(3.99) 


sin i _ a \ 
sinr a 2 


(3.100) 


Equation (3.100) is called the law of refraction for 
plane seismic waves. Its equivalent in optics is often called 
Snell’s law, in recognition of its discoverer, the Dutch 
mathematician Willebrod Snellius (or Snell). 


3.62.3 Diffraction 

The laws of reflection and refraction derived above with 
the aid of Huygens' principle apply to the behavior of 
plane seismic waves at plane boundaries. When a plane or 
spherical seismic wave encounters a pointed obstacle or 
discontinuous surface, it experiences diffraction. This phe¬ 
nomenon allows the wave to bend around the obstacle, 
penetrating what otherwise would be a shadow zone for 
the wave. It is the diffraction of sound waves, for example, 
that allows us to hear the voices of people who are still 
invisible to us around a corner, or on the other side of a 
high fence. Huygens’ principle also gives an explanation 
for diffraction, as illustrated by the following simple case. 

Consider the normal incidence of a plane wave on 
a straight boundary that ends at a sharp comer B 
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Fig. 3.55 Explanation of diffraction at an edge with the aid of Huygens' 
principle, (a) The incident and reflected plane wavefronts are the 
envelopes to Huygens wavelets, which are able to carry the incident 
disturbance around a sharp corner, (b) The incident rays are absorbed, 
reflected or pass by the obstacle, but some rays related to the wavelets 
generated at the point of the obstruction are diffracted into its shadow. 

(Fig. 3.55). The incident wave is reflected along the entire 
length AB, with each particle of AB acting as a secondary 
source according to Fluygen’s principle. Beyond the edge 
B the incident wavefronts cannot be reflected. The plane 
wavefront passes by the edge B, so that the point C should 
lie in the shadow of AB. However, the corner B also acts 
as a source of secondary wavelets, part of which con¬ 
tribute to the reflected wavefront and part pass into the 
shadow zone. The intensity of the wave diffracted into the 
shadow zone is weaker than in the main wavefront, and it 
decreases progressively with increasing angle away from 
the direction of travel of the incident wavefront. 



Fig. 3.56 Geometry of incident and reflected rays for derivation of the 
law of reflection with the aid of Fermat's principle. 


the points. If dx is the element of distance along a ray path 
and c is the seismic velocity over this short distance, then 
the travel-time t between A and B is minimum. Thus, 


t = 


B 



= minimum 


(3.101) 


Generally, when the velocity varies continuously with 
position, the determination of the ray path is intricate. In 
the case of a layered medium, in which the velocity is con¬ 
stant in each layer, Fermat’s principle provides us with an 
independent method for determining the laws of reflec¬ 
tion and refraction. 


3.6.3.1 The law of reflection using Fermat's principle 

Consider the reflection of a seismic ray in a medium with 
constant P-wave velocity ot l at the boundary to another 
medium (Fig. 3.56). For convenience we take the bound¬ 
ary to be horizontal. Let A be a point on the incident ray 
at a vertical distance h from the boundary and let B be the 
corresponding point on the reflected ray. Let C and D be 
the nearest points on the boundary to A and B, respec¬ 
tively. Further, let dbe the horizontal separation AB, and 
let O be the point of reflection on the interface at a hori¬ 
zontal distance x from C; then OD is equal to (d - x) and 
we can write for the travel-time t from A to B: 

t = = grjj y 111 2 + x 2 + V h 2 + {d — x) 2 ] (3.102) 

According to Fermat’s principle the travel-time t must 
be a minimum. The only variable in Eq. (3.102) is x. To find 
the condition that gives the minimum travel-time we 
differentiate t with respect to x and set the result equal to 
zero: 


3.6.3 Fermat's principle 

The behavior of seismic ray paths at an interface is 
explained by another principle of optics that was 
formulated - also in the seventeenth century - by the 
French mathematician Pierre de Fermat. As applied to 
seismology, Fermat’s principle states that, of the many 
possible paths between two points A and B, the seismic ray 
follows the path that gives the shortest travel-time between 


dt = 1 f x _ fd-x) 

d- x a i _v//z 2 + x 2 v/ h 1 + (d — x ) 2 


(3.103) 


By inspection of Fig. 3.56 the relationships of these 
expressions to the angle of incidence (z) and the angle of 
reflection (/') are evident. The first expression inside the 
brackets is sin i and the second is sin The condition for 
the minimum travel-time is again /' = i; the angle of reflec¬ 
tion equals the angle of incidence. 
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Fig. 3.57 Geometry of incident and refracted rays for derivation of the 
law of refraction with the aid of Fermat's principle. 


3.6.3.2 The law of refraction using Fermat's principle 

We can use a similar approach to determine the law of 
refraction. This time we study the passage of the seismic 
ray from a medium with velocity a 1 into a medium with 
higher velocity a 2 (Fig. 3.57). Let A again be a point on 
the incident ray at a vertical distance h from a point C on 
the interface. The ray traverses the boundary at O, a hori¬ 
zontal distance x from C. Let B now be a point on the ray 
in the second medium at a distance h from D, the closest 
point on the interface. The distance CD is d, so that again 
OD is equal to (d - x). The travel-time t which we have to 
minimize is given by 


1 a. 


(3.104) 


Differentiating Eq. (3.104) with respect to x and 
setting the result equal to zero gives us the condition for 
the minimum value of t: 


x _ (d-x) 

aqy//; 2 + x 2 a 7 ^h 2 + (d — x) 2 


(3.105) 


By reference to Fig. 3.57 we can write this expression 
in terms of the sines of the angles of incidence ( i ) and 
refraction ( r ). This application of Fermat’s principle to 
the seismic ray paths gives again the law of refraction that 
we derived by applying Fluygens’ principle to the wave- 
fronts (Eq. (3.100)). It can also be stated as 


sin/. 
a , 


sm /• 
a- 


(3.106) 


In this example we have assumed that a 2 >a l . As it 
passes from the medium with lower velocity into the 
medium with higher velocity the refracted ray is bent 
away from the normal to the boundary, giving an angle 
of refraction that is greater than the angle of incidence 
(r >/). Under the opposite conditions, if a 2 <a v the 
refracted ray is bent back toward the normal and the 


angle of refraction is less than the angle of incidence 
0 < 0 - 


3.6.4 Partitioning of seismic body waves at a boundary 

The conditions that must be fulfilled at a boundary are 
that the normal and tangential components of stress, as 
well as the normal and tangential components of the 
displacements, must be continuous across the interface. 
If the normal (or tangential) stress were not continuous, 
the point of discontinuity would experience infinite 
acceleration. Similarly, if the normal displacements 
were not continuous, a gap would develop between the 
media or parts of both media would overlap to occupy 
the same space; discontinuous tangential displacements 
would result in relative motion between the media 
across the boundary. These anomalies are impossible if 
the boundary is a fixed surface that clearly separates the 
media. 

As a result of the conditions of continuity, a P-wave 
incident on a boundary energizes the particles on each 
side of the boundary at the point of incidence, and sets up 
four waves. The energy of the incident P-wave is parti¬ 
tioned between P- and S-waves that are reflected from the 
boundary, and other P- and S-waves that are transmitted 
into the adjacent layer. The way in which this takes place 
may be understood by considering the particle motion 
that is induced at the interface. 

The particle motion in the incident P-wave is parallel 
to the direction of propagation. At the interface the 
vibration of particles of the lower layer can be resolved 
into a component perpendicular to the interface and a 
component parallel to it in the vertical plane containing 
the incident P-wave. In the second layer each of these 
motions can in turn be resolved into a component parallel 
to the direction of propagation (a refracted P-wave) and a 
component perpendicular to it in the vertical plane (a 
refracted SV-wave). Because of continuity at the inter¬ 
face, similar vibrations are induced in the upper layer, 
corresponding to a reflected P-wave and a reflected SV- 
wave, respectively. 

Let the angles between the normal to the interface and 
the ray paths of the P- and S-waves in medium 1 be i and 
i s , respectively, and the corresponding angles in medium 2 
be r and r (Fig. 3.58). Applying Snell’s law to both the 
reflected and refracted P- and S-waves gives 


sin/ p sin/ s sin/ p sinr s 


(3.107) 


By similar reasoning it is evident that an incident SV- 
wave also generates vibrations that have components 
normal and parallel to the interface, and will set up 
refracted and reflected P- and SV-waves. The situation is 
different for an incident SH-wave, which has no compo¬ 
nent of motion normal to the interface. In this case only 
refracted and reflected SH-waves are created. 
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Fig. 3.58 The generation of reflected and refracted P- and S-waves 
from a P-wave incident on a plane interface. 

3.6.4.1 Subcritical and supercritical reflections, and critical 
refraction 

Let O be a seismic source near the surface of a uniformly 
thick horizontal layer with P-wave velocity a l that lies on 
top of a layer with higher velocity a 1 (Fig. 3.59). Consider 
what happens to seismic rays that leave O and arrive at the 
boundary with all possible angles of incidence. The most 
simple ray is that which travels vertically to meet the 
boundary with zero angle of incidence at the point N. 
This normally incident ray is partially reflected back along 
its track, and partially transmitted vertically into the next 
medium without change of direction. As the angle of 
incidence increases, the point of incidence moves from 
N towards C. The transmitted ray experiences a change 
of direction according to SnelFs law of refraction, and 
the ray reflected to the surface is termed a subcritical 
reflection. 

The ray that is incident on the boundary at C is 
called the critical ray because it experiences critical 
refraction. It encounters the boundary with a critical 
angle of incidence. The corresponding refracted ray 
makes an angle of refraction of 90° with the normal to 
the boundary. As a result, it travels parallel to the 
boundary in the top of the lower layer with faster veloc¬ 
ity a 7 . The sine of the angle of refraction of the critical 
ray is unity, and we can calculate the critical angle, i c , by 
applying Snell’s law: 

sin ' c = «2 (3-108) 

The critical ray is accompanied by a critical reflection. It 
reaches the surface at a critical distance (x c ) from the 
source at O. The reflections that arrive inside the critical 


critical 

distance 



Fig. 3.59 The critical reflection defines two domains, corresponding to 
regions of subcritical and supercritical reflection, respectively. 

distance are called subcritical reflections. At angles up to 
the critical angle, refracted rays pass into the lower 
medium, but for rays incident at angles greater than the 
critical angle, refraction is no longer possible. The seismic 
rays that are incident more obliquely than the critical angle 
are reflected almost completely. These reflections are 
termed supercritical reflections, or simply wide-angle reflec¬ 
tions. They lose little energy to refraction, and are thus 
capable of travelling large distances from the source in the 
upper medium. Supercritical reflections are recorded with 
strong amplitudes on seismograms at distant stations. 

3.6.5 Reflection seismology 

Reflection seismology is directed primarily at finding the 
depths to reflecting surfaces and the seismic velocities of 
subsurface rock layers. The techniques of acquiring and 
processing reflection seismology data have been developed 
and refined to a very high degree of sophistication as a 
result of the intensive application of this method in the 
search for petroleum. The principle is simple. A seismic 
signal (e.g., an explosion) is produced at a known place at 
a known time, and the echoes reflected from the bound¬ 
aries between rock layers with different seismic velocities 
and densities are recorded and analyzed. Compactly 
designed, robust, electromagnetic seismometers - called 
“geophones” in industrial usage - are spread in the region 
of subcritical reflection, within the critical distance from 
the shot-point, where no refracted arrivals are possible. 
Within this distance the only signals received are the wave 
that travels directly from the shot-point to the geophones 
and the waves reflected at subsurface interfaces. Surface 
waves are also recorded and constitute an important dis¬ 
turbing “noise,” because they interfere with the reflected 
signal. The closer the geophone array is located to the 
shot-point, the more nearly the paths of the reflected rays 
travel vertically. Reflection seismic data are most usually 
acquired along profiles that cross geological structures as 
nearly as possible normal to the strike of the structure. 
The travel-times recorded at the geophones along a profile 
are plotted as a two-dimensional cross-section of the 
structure. In recent years, three-dimensional surveying, 
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Fig. 3.60 The split-spread method of obtaining continuous subsurface 
coverage of a seismic reflector. 


which covers the entire subsurface, has become more 
important. 

Several field procedures are in common use. They are 
distinguished by different layouts of the geophones rela¬ 
tive to the shot-point. The most routine application of 
reflection seismology is in continuous profiling, in which 
the geophones are laid out at discrete distances along a 
profile through the shot-point. To reduce seismic noise, 
each recording point is represented by a group of inter¬ 
connected geophones. After each shot the geophone 
layout and shot-point are moved a predetermined dis¬ 
tance along the profile, and the procedure is repeated. 
Broadly speaking, there are two main variations of this 
method, depending on whether each reflection point on 
the reflector is sampled only once ( conventional coverage) 
or more than once ( redundant coverage). 

The most common form of conventional coverage is a 
split-spread method (Fig. 3.60), in which the geophones are 
spread symmetrically on either side of the shot-point. If the 
reflector is flat-lying, the point of reflection of a ray 
recorded at any geophone is below the point midway 
between the shot-point and the geophone. For a shot-point 
at Q the rays QAP and QBR that are reflected to geophones 
at P and R represent extreme cases. The two-way travel¬ 
time of the ray QAP gives the depth of the reflection point 
A, which is plotted below the mid-point of QP. Similarly, B 
is plotted below the mid-point of QR. The split-spread 
layout around the shot-point Q gives the depths of reflec¬ 
tion points along AB, which is half the length of the geo¬ 
phone spread PR. The shot-point is now moved to the point 
R, and the geophones between P and Q are moved to cover 
the segment RS. From the new shot-point R the positions 
of reflection points in the segment BC of the reflector are 
obtained. The ray RBQ from shot-point R to the geophone 
at Q has the same path as the ray QBR from shot-point Q to 
the geophone at R. By successively moving the shot-point 
and half of the split-spread geophone layout a continuous 
coverage of the subsurface reflector is obtained. 

Redundant coverage is illustrated by the common-mid¬ 
point method, which is routinely employed as a means of 
reducing noise and enhancing the signal-to-noise ratio. 
Commonly 24 to 96 groups of geophones feed recorded 
signals into a multi-channel recorder. The principle of 
common-mid-point coverage is illustrated for a small 
number of 11 geophone groups in Fig. 3.61. When a shot 
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Fig. 3.61 Common-mid-point method of seismic reflection shooting, 
showing rays from successive shot-points at A, B and C and the 
repeated sampling of the same point on the reflector (e.g., d) by rays 
from each shot-point. 

is fired at A, the signals received at geophones 3-11 give 
subsurface coverage of the reflector between points a and 
e. The shot-point is now moved to B, which coincides with 
the position occupied by geophone 2 for the first shot, 
and the geophone array is moved forward correspond¬ 
ingly along the direction of the profile to positions 4-12. 
From shot-point B the subsurface coverage of the reflec¬ 
tor is between points b and f. The reflector points b to e 
are common to both sets of data. By repeatedly moving 
the shot-point and geophone array in the described 
manner, each reflecting point of the interface is sampled 
multiply. For example, in Fig. 3.61 the reflecting point d is 
sampled multiply by the rays Ad 9, Bd8, Cd7, etc. The 
lengths of these ray paths are different. During subse¬ 
quent data-processing the reflection travel-times are cor¬ 
rected for normal moveout, which is a geometrical effect 
related to geophone distance from the shot-point. The 
records are then stacked, which is a procedure for enhanc¬ 
ing the signal-to-noise ratio. 

3.6.5.1 Reflection at a horizontal interface 

The simplest case of seismic reflection is the two-dimen¬ 
sional reflection at a horizontal boundary (Fig. 3.62). Let 
the reflecting bed be at depth d below the shot-point S. 
The ray that strikes the boundary at R is reflected to the 
surface and recorded by a geophone at the point G, so 
that the angles of incidence and reflection are equal. Let 
G be at a horizontal distance .v from the shot-point. If the 
P-wave velocity is V, the first signal received at G is from 
the direct wave that travels directly along SG. Its travel¬ 
time is given by t d =x/V. It is important to keep in mind 
that the direct wave is not a surface wave but a body wave 
that travels parallel to and just below the surface of the 
top layer. The travel-time t of the reflected ray SRG 
is (SR + RG)/ V. However, SR and RG are equal and 
therefore 
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laid out close to the shot-point and the assumption is made 
that the geophone distance is much less than the depth of 
the reflector (x < d). Equation (3.110) becomes 


fv-fnU + lgj 2 
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(3.112) 


The difference between the travel-time t and the shot- 
point travel-time t Q is the normalmoveout, A/ n = t x -t 0 . By 
rearranging Eq. (3.112) we get 


A t = 


2V 2 t n 


(3.113) 


The echo time t 0 and the normal moveout time At n 
are found from the reflection data. The distance x of the 
geophone from the shot-point is known and therefore 
the layer velocity V can be determined. The depth d of the 
reflecting horizon can then be found by using the formula 
for the echo time. 

An alternative way of interpreting reflection arrival 
times becomes evident when Eq. (3.111) is rearranged in 
the form 


Fig. 3.62 The travel-time versus distance curve for reflections from a 
horizontal boundary is a hyperbola. The vertical reflection time t 0 is the 
intercept of the hyperbola with the travel-time axis. 


t = 



4 d 2 



(3.110) 


At ,x: = 0 the travel-time corresponds to the vertical 
echo from the reflector; this “echo-time” is given by t 0 = 
2 d/V. The quantity under the square root in Eq. (3.110) 
determines the curvature of the t-x curve and is called the 
normal moveout factor. It arises because the ray reaching a 
geophone at a horizontal distance x from the shot-point 
has not travelled vertically between it and the reflector. 
Squaring both sides of Eq. (3.110) and rearranging terms 
gives 



(3.111) 


This is the equation of a hyperbola (Fig. 3.62) that is 
symmetrical about the vertical time axis, which it intersects 
at t Q . For large distances from the shot-point (x > 2d) the 
travel-time of the reflected ray approaches the travel-time 
of the direct ray and the hyperbola is asymptotic to the two 
lines t= ± x/V. 

A principle goal of seismic reflection profiling is usually 
to find the vertical distance (d) to a reflecting interface. 
This can be determined from t 0 , the two-way reflection 
travel-time recorded by a geophone at the shot-point, once 
the velocity Vis known. One way of determining the veloc¬ 
ity is by comparing ? 0 with the travel-time t x to a geophone 
at distance x. In reflection seismology the geophones are 


t 2 = 



(3.114) 


A plot of t 2 against x 2 is a straight line that has slope 
1 IV 2 . Its intercept with the t 2 -axis gives the square of the 
echo time, t 0 , from which the depth d to the reflector can 
be found once the velocity V is known. The record at each 
geophone will contain reflections from several reflectors. 
For the first reflector the velocity determined by the t 2 -x 2 
method is the true interval velocity of the uppermost layer, 
Fj, which, in conjunction with f 01 , the first echo time, gives 
the thickness <7, of the top layer. Flowever, the ray reflected 
from the second interface has travelled through the first 
layer with interval velocity F, and the second layer with 
interval velocity V 2 . The velocity interpreted in the t 2 -x 2 
method for this reflection, and for reflections from all 
deeper interfaces, is an average velocity. If the incident and 
reflected rays travel nearly vertically, the average velocity 
V a for the reflection from the nth reflector is given by 


^3 "f ' 


+ t„ 


2X 

i= 1 
n 


(3.115) 


where d- is the thickness and t, the interval travel-time for 
the /th layer. 

The t 2 -x 2 method is a simple way of estimating layer 
thicknesses and average velocities for a multi-layered 
Earth (Fig. 3.63). The slope of the second straight line 
gives F a 2 , which is used with the appropriate echo time t 02 
to find the combined depth Z> ? to the second interface, 
given by D 2 = d x + d 2 = (F a 2 )(t 02 ); d x is known, and so d 2 
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Fig. 3.63 Illustration of a "f 2 -x 2 plot" for near-vertical reflections from 
three horizontal reflectors; V , is the true velocity V\ of layer 1, but V 2 
and V a 3 are "average" velocities that depend on the true velocities and 
the layer thicknesses. 


can be calculated. The two-way travel-time in the second 
layer is (r 02 - f 01 ) and thus the interval velocity V 2 can be 
found. In this way the thicknesses and interval velocities 
of deeper layers can be determined successively. 

In fact, of course, the rays do not travel vertically but 
are bent as they pass from one layer to another (Fig. 
3.63). Moreover, the elastic properties of a layer are rarely 
homogeneous so that the seismic velocity is variable and 
the ray path in the layer is curved. Exploration seismolo¬ 
gists have found it possible to compensate for these effects 
by replacing the average velocity with the root-mean- 
square velocity V rms defined by 

VLs = - V— (3.116) 

2 >« 

i— 1 

where V j is the interval velocity and t t the travel-time for 
the z'th layer. 


3.6.52 Reflection at an inclined interface 

When the reflecting interface is inclined at an angle 9 to 
the horizontal, as in Fig. 3.64, the shortest distance d 
between the shot-points and the reflector is the perpen¬ 
dicular distance to the inclined plane. The paths of 
reflected rays on the down-dip side of the shot-point are 
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Fig. 3.64 The travel-time versus distance curve for an inclined 
reflector is also a hyperbola with vertical axis (cf. Fig. 3.62), but the 
minimum travel-time (f ) is measured at distance x m from the 
shot-point. 


longer than those on the up-dip side; this has correspond¬ 
ing effects on the travel-times. The rays obey the laws of 
reflection optics and appear to return to the surface from 
the point S', which is the image point of the shot-point 
with respect to the reflector. The travel-time t through the 
layer with velocity V is readily found with the aid of the 
image point. For example for the ray SRG, recorded by a 
geophone on the surface at G, we get 

,_ SR + RG _ S'R + RG _ S'G 11Tl 

V - V - V 


The image point S' is as far behind the reflector as the 
shot-point is in front: S'S = 2rf. In triangle S'SG the side 
SG equals the geophone distance ,v and the obtuse angle 
S'SG equals (90°+ 0). If we apply the law of cosines to 
the triangle S'SG we can solve for S'G and substitute the 
answer in Eq. (3.117). This gives the travel-time of the 
reflection from the inclined boundary: 



(3.118) 
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Fig. 3.65 When the reflector is inclined and depths are plotted vertically 
under geophone positions, the true reflecting points A, B and C are 
mapped at A', B' and C', falsifying the position of the reflector. 


Equation (3.118) is the equation of a hyperbola whose 
axis of symmetry is vertical, parallel to the f-axis. For a 
flat reflector the hyperbola was symmetric about the t-axis 
(Fig. 3.62) and the minimum travel-time (echo time) cor¬ 
responded to the vertical reflection below the shot-point 
(x = 0). For an inclined reflector the minimum travel-time 
t m is no longer the perpendicular path to the reflector, 
which would give the travel-time t 0 in Fig. 3.64. Although 
the perpendicular path is the shortest distance from shot- 
point to reflector , it is not the shortest path of a reflected 
ray between the shot-point and a geophone. The shortest 
travel-time is recorded by the geophone at a horizontal 
distance x m on the up-dip side of the shot-point. The 
coordinates {x m , t m ) of the minimum point of the travel- 
time hyperbola are 


x m = — 2d sin 9 
_ 2dcos6 

'm y 


(3.119) 


In practice it is not known a priori whether a reflector is 
horizontal or inclined. If reflection records are not cor¬ 
rected for the effect of layer-dip, an error results in plotting 
the positions of dipping beds. The shot-point travel-time t 0 
gives the direct distance to a reflector, but the path along 
which the echo has travelled is not known. Consider the 
geometry of the inclined boundary in Fig. 3.65. First 
arrival reflections recorded for shot-points P, Q, and R 
come from the true reflection points A, B, and C. If the 
computed reflector depths are plotted directly below the 
shot-points at A', B' and C', the dipping boundary will 
appear to lie at a shallower depth than its true position, and 
the apparent dip of the reflector will be less steep than the 
true dip. This leads to a distorted picture of the under¬ 
ground structure. For example, an anticline appears 
broader and less steep-sided than it is. Similarly, if the limbs 
of a syncline dip steeply enough, the first arrivals from the 
dipping limbs can conceal the true structure (Fig. 3.66). 

This happens when the radius of curvature of the 
bottom of the syncline is less than the subsurface depth of 


surface A B C D,E F G 



Fig. 3.66 Paths of reflected rays over an anticline and syncline, showing 
the false apparent depth to the reflecting surface. True reflection points 
A-G are wrongly mapped at locations a-g beneath the corresponding 
shot-points. 



Fig. 3.67 (a) Paths of rays reflected from both flanks and the trough of 
a tightly curved syncline, (b) Appearance of the corresponding reflection 
record; the letters on the cusped feature refer to the reflection points 
in (a). 


its axis. Over the axis of the syncline, rays reflected from 
the dipping flanks may be the first to reach the shot-point 
geophone. The bottom of the syncline is seen as an 
upwardly convex reflection between two cusps (Fig. 3.67). 
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Fig. 3.68 Partitioning of the 
energy of an incident P-wave 
between refracted and 
reflected P- and S-waves 
(after Dobrin, 1976 and 
Richards, 1961). 


P-waves 
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On an uncorrected reflection record, the appearance of a 
tight syncline resembles a diffraction. 

Reflection seismic records must be corrected for non¬ 
vertical reflections. The correctional process is called 
migration. It is an essential part of a reflection seismic 
study. When the reflection events on seismic cross- 
sections are plotted vertically below control points on the 
surface (e.g., as the two-way vertical travel-time to a 
reflector below the shot-point), the section is said to be 
unmigrated. As discussed above, an unmigrated section 
misrepresents the depth and dip of inclined reflectors. A 
migrated section is one which has been corrected for non¬ 
vertical reflections. It gives a truer picture of the positions 
of subsurface reflectors. 

The process of migration is complex, and requires 
prior knowledge of the seismic velocity distribution, 
which in an unexplored or tectonically complicated 
region is often inadequately known. Several techniques, 
mostly computer-based, can be used but to treat them 
adequately is beyond the scope of this text. 

3.6.53 Reflection and transmission coefficients 

The partitioning of energy between refractions and reflec¬ 
tions at different angles of incidence on a boundary is 
rather complex. For example, an incident P-wave may be 
partially reflected and partially refracted, or it may be 
totally reflected, depending how steeply the incident ray 
encounters the boundary. The fraction of the incident P- 
wave energy that is partitioned between reflected and 
refracted P- and S-waves depends strongly on the angle of 
incidence (Fig. 3.68). In the case of oblique incidence at 
less than the critical angle, the amplitudes of the different 
waves are given by complicated functions of the wave 
velocities and the angles of incidence, reflection and 


refraction. The relative amounts of energy in the 
refracted and reflected P- and S-waves do not change 
much for angles of incidence up to about 15°. Beyond 
the critical angle, the refracted P-wave ceases, so that the 
incident energy is partly reflected as a P-wave and partly 
converted to refracted and reflected S-waves. 

In practice, reflection seismology is carried out at 
comparatively small angles of incidence. At normal inci¬ 
dence on an interface a P-wave excites no tangential 
stresses or displacements, and no shear waves are 
induced. The partitioning of energy between the 
reflected and refracted P-waves then becomes much 
simpler. It depends on a property of each medium known 
as its acoustic impedance, Z, which is defined as the 
product of the density p of the medium and its P-wave 
velocity a; thus Z = pa. The solution of the equations for 
the amplitudes A 1 and A 2 of the reflected and refracted 
P-waves, respectively, in terms of the amplitude A 0 of the 
incident wave are given by: 

pr A i Z 2~ Z 1 P 2 a 2 -Pi a i 
A 0 Z 2 + Z 1 Pl a 2 + Pl«l 
A 1 2 Z, 2p,a, 

TC = T 0 = Z^Z I = p 2 a 2 + p 1 a 1 (3 ' 120) 

The amplitude ratios RC and TC are called the reflec¬ 
tion coefficient and the transmission coefficient, respec¬ 
tively. As shown earlier (see Eq. (3.70)), the energy of a 
wave is proportional to the square of its amplitude. The 
fraction E r of the incident energy that is reflected is given 
by the square of RC, the fraction E t that is transmitted is 
equal to (1 —E t ). 

When the incident wave is reflected at the surface of a 
medium with higher seismic impedance ( Z 2 >Z X ), the 
reflection coefficient RC is positive. This means that the 
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reflected wave is in phase with the incident wave. How¬ 
ever, if the wave is incident on a medium with lower 
seismic impedance (Z n < Z x ), the reflection coefficient will 
be negative. This implies that the reflected wave is 180° out 
of phase with the incident wave. The fraction of energy 
reflected from an interface is equal to RC 2 , and therefore 
does not depend on whether the incidence is from the 
medium of higher or lower seismic impedance. 

3.65.4 Synthetic seismograms 

The travel-time of a reflection from a deep boundary in a 
multi-layered Earth is determined by the thicknesses and 
seismic velocities of the layers the seismic ray traverses. 
The amplitude of the recorded reflection is determined by 
the transmission and reflection coefficients for the subsur¬ 
face interfaces. If the densities and seismic velocities of 
subsurface layers are known (for example, from sonic and 
density logs in conveniently located boreholes), it is possi¬ 
ble to reconstruct what the seismogram should look like. 
Sometimes the density variations are ignored, and reflec¬ 
tion and transmission coefficients are calculated simply 
on the basis of seismic velocities. This approximation can 
often be very useful, for example in the exploration of the 
deep structure of the Earth's crust with the seismic reflec¬ 
tion method. The densities of deep layers are inaccessible 
directly, although they can be inferred from seismic veloc¬ 
ities. A vertical model of seismic velocities may be avail¬ 
able from a related refraction study. These data can be 
incorporated in a deep seismic reflection study to calcu¬ 
late a synthetic seismogram. The comparison of real and 
synthetic seismograms is useful for correlating reflection 
events and for separating real reflections from noise 
signals such as multiples. 

The principle is simple, but the construction is labori¬ 
ous. A vertically incident wave on the first boundary is 
resolved into reflected and transmitted components, with 
amplitudes corresponding to the seismic impedances 
above and below the interface. The transmitted wave is 
further subdivided at the next deeper interface into other 
reflected and transmitted components, and this is 
repeated at each subsequent boundary. Each wave is fol¬ 
lowed as it is reflected and refracted at subsurface inter¬ 
faces until it eventually returns to the surface. The 
theoretical record consists of the superposition of the 
numerous events, and represents the total travel-time and 
amplitude of each event. Whether in a high-resolution 
reflection study of near-surface sedimentary layering for 
petroleum exploration or in an analysis of deep crustal 
structure, the construction of a synthetic seismogram is 
an exacting chore that requires the use of fast modern 
computers. 

3.65.5 Seismic noise 

Controlled-source seismology allows fine resolution of a 
layered underground through analysis of the seismic 


travel-times. However, the seismic record contains not 
only primary reflections, or signals, from subsurface 
interfaces but also spurious secondary events, or noise, 
that interfere with the desired signals. The ratio of the 
energy in the signal to that in the noise, called the signal- 
to-noise ratio, is a measure of the quality of a seismic 
record. The higher the signal-to-noise ratio the better the 
record; records that have a ratio less than unity are 
unlikely to be usable. 

There are many ways in which seismic noise can be 
excited. They can be divided into incoherent (or random) 
noise and coherent noise. Incoherent noise is local in 
origin and is caused by shallow inhomogeneities such as 
boulders, roots or other non-uniformities that can scatter 
seismic waves locally. As a result of its local nature inco¬ 
herent noise is different on the records from adjacent geo¬ 
phones unless they are very close. In reflection seismology 
it is reduced by arranging the geophones in groups or 
arrays, of typically 16 geophones per group, and combin¬ 
ing the individual outputs to produce a single record. 
When n geophones form a group, this practice enhances 
the signal-to-noise ratio by the factor sin. 

Coherent noise is present on several adjacent traces of 
a seismogram. Two common forms result from surface 
waves and multiple reflections, respectively. 

A near-surface explosion excites surface waves (espe¬ 
cially Rayleigh waves) that can have strong amplitudes. 
They travel more slowly than P-waves but often reach the 
geophones together with the train of subsurface reflec¬ 
tions. The resultant “ground roll ” can obscure the reflec¬ 
tions, particularly if they are weak. The problem can be 
minimized by the geometry of the geophone layout (Fig. 
3.69a). For example, if the 16 geophones in a group are 
laid out at equal distances to cover a complete wavelength 
of the Rayleigh wave, the signals of individual geophones 
are effectively integrated to a low value. This procedure is 
only partially effective. Although most of the ground roll 
is related to Rayleigh waves, part is thought to have more 
complex near-surface origins. Another method of reduc¬ 
ing the effects of ground roll is frequency filtering. The 
frequency of the ground roll is often lower than that of 
the reflected P-waves, which allows attenuation of this 
type of coherent noise by including a high-pass filter in 
the geophone circuit or in the subsequent processing to 
cut out low frequencies. 

Multiple reflections are a very common source of 
coherent noise in a layered medium. They can originate in 
several ways, the most serious of which is the surface multi¬ 
ple (Fig. 3.69b). The reflection coefficient at the free 
surface of the Earth is high (in principle, RC = -1), and 
multiple reflections can occur between the surface and a 
reflecting interface. The travel-time for a first-order multi¬ 
ple at near-vertical incidence is double that of the primary 
signal. A copy of the reflector is observed on the seismo¬ 
gram at twice the real depth (or travel-time). Higher-order 
multiples produce additional apparent reflectors. A further 
advantage of using the common-mid-point method of 
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(a) "ground roll" 


wavelength X of ground motion 



(b) multiple reflection 



Fig. 3.69 Examples of seismic noise: (a) "ground roll" due to surface 
wave, and (b) multiple reflections between a reflector and the free 
surface. 

reflection profiling is that it is effective in attenuating the 
surface multiples. 

3.65.6 Reflection seismic section 

After migration, the cleaned records from all the 
seismometers are plotted side-by-side to form a sort of 
cross-section of the underground structure beneath the 
reflection profile. Strong reflectors can be followed across 
the section, and where they are interrupted, faults can be 
deduced. The top part of Fig. 3.70 shows the results of a 
113 km long, nearly north-south crustal reflection profile 
across western Lake Superior at the northern end of the 
North American Mid-Continent Rift System. This is an 
aborted Precambrian (age —1000 Ma) rift that has promi¬ 
nent gravity and magnetic expressions. The profile (Line 
C) was carried out with closely spaced shot-points using 
the common-mid-point method. The records were subse¬ 
quently stacked and migrated. The lower part of the 
figure shows the interpreted subsurface structures above a 
rather steeply dipping Moho, which under this profile 
increases in depth from about 32 km in the north to about 
50 km in the south. The section shows the presence of 
some inferred major faults, such as the southward 
dipping Douglas fault and the northward dipping 
Keweenawan fault, which appears to truncate other 
steeply dipping faults. 


3.6.6 Refraction seismology 

The method of seismic refraction can be understood by 
applying Huygens’ principle to the critical refraction at the 
interface between two layers. The seismic disturbance 
travels immediately below the interface with the higher 
velocity of the lower medium. It is called a head wave 
(or Mintrop wave, after the German seismologist who 
patented its use in seismic exploration in 1919). The upper 
and lower media are in contact at the interface and so the 
upper medium is forced to move in phase with the lower 
medium. The vibration excited at the boundary by the 
passage of the head wave acts as a moving source of sec¬ 
ondary waves in the upper layer. The secondary waves 
interfere constructively (in the same way as a reflected wave 
is formed) to build plane wavefronts; the ray paths return 
to the surface at the critical angle within the region of 
supercritical reflections (Fig. 3.71). The doubly refracted 
waves are especially important for the information they 
reveal about the layered structure of the deep interior of 
the Earth. 


3.6.6.1 Refraction at a horizontal interface 

The method of refraction seismology is illustrated for the 
case of the flat interface between two horizontal layers in 
Fig. 3.71. Let the depth to the interface be d and the 
seismic velocities of the upper and lower layers be V 1 and 
V 2 respectively (V l <V 2 ). The direct ray from the shot- 
point at S is recorded by a geophone G at distance x on the 
surface after time x/V j. The travel-time curve for the direct 
ray is a straight line through the origin with slope 
m l = 1/Fj. The hyperbolic t-x curve for the reflected ray 
intersects the time axis at the two-way vertical reflection 
(“echo”) time t 0 . At great distances from the shot-point 
the reflection hyperbola is asymptotic to the straight line 
for the direct ray. 

The doubly refracted ray travels along the path SC with 
the velocity V 1 of the upper layer, impinges with critical 
angle i c on the interface at C, passes along the segment 
CD with velocity V 2 of the lower layer, and returns to the 
surface along DG with velocity V 1 . The segments SC and 
DG are equal, CD = x - 2SA and the travel-time for the 
path SCDG can be written 


, 2SC , CD 

r r t v 2 


(3.121) 


i.e., 


t = 


2d , (*-2c/tan/ c ) 


Fi c °s; c 


F, 


(3.122) 


Rearranging terms and using Snell’s law, sin i c = V l IV 2 , 
we get for the travel-time of the doubly refracted ray 


x . 2d 

t- Vl + v x C0Si c 


(3.123) 
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Fig. 3.70 Results of deep 
reflection profiling across the 
North American Mid- 
continent Rift System under 
western Lake Superior. The 
top part of the figure shows 
the migrated reflection 
record, the bottom part the 
interpreted crustal structure 
(courtesy of A. G. Green). 





refracted ray and a reflection; the travel-time line for the 
head wave is tangential to the reflection hyperbola at x c . 
By backward extrapolation the refraction t-x curve is 
found to intersect the time axis at the intercept time t x , 
given by 

. _2 d . „ A v l~ v i ,,.... 

h y C0S, C yy (3.1-4) 

' 1 r l r 2 


Close to the shot-point the direct ray is the first to be 
recorded. However, the doubly refracted ray travels part 
of its path at the faster velocity of the lower layer, so that 
it eventually overtakes the direct ray and becomes the first 
arrival. The straight lines for the direct and doubly 
refracted rays cross each other at this distance, which is 
accordingly called the crossover distance, x a . It is com¬ 
puted by equating the travel-times for the direct and 
refracted rays: 


x x . „ , ^1 

V, v 2 Acl V, v 2 


= 2 d 




V 2 -V x 


(3.125) 

(3.126) 


Fig. 3.71 Travel-time versus distance curves for the direct ray and the 
reflected and refracted rays at a horizontal interface between two layers 
with seismic velocities l/, and V 2 (i/, > V 2 ). 

The equation represents a straight line with slope 
m 2 = 1 IV 2 . The doubly refracted rays are only recorded at 
distances greater than the critical distance x c . The first 
arrival recorded at x c can be regarded as both a doubly 


Refraction seismology gives the velocities of sub¬ 
surface layers directly from the reciprocal slopes of 
the straight lines corresponding to the direct and 
doubly refracted rays. Once these velocities have been 
determined it is possible to compute the depth d to the 
interface by using either the intercept time t t or the 
crossover distance x cr , which can be read directly from 
the t-x plot: 












184 Seismology and the internal structure of the Earth 


d = 


1, W 

2 V\ 


d = 


1 

2 X cr 


r 

V 


(3.127) 

(3.128) 


3.6.62 Refraction at an inclined interface 

In practice, the refracting interface is often not horizontal. 
The assumption of flat layers then leads to errors in the 
velocity and depth estimates. When the refractor is sus¬ 
pected to have a dip, the velocities of the beds and the dip 
of the interface can be obtained by shooting a second com¬ 
plementary profile in the opposite direction. Suppose a 
refractor dips at an angle 0 as in Fig. 3.72. Shot-points A 
and B are located at the ends of a geophone layout that 
covers AB. The ray ACDB from the shot-point A strikes 
the interface at the critical angle i c at C, runs as a head wave 
with velocity F, along the dipping interface, and the ray 
emerging at D eventually reaches a geophone at the end of 
the profile at B. During reverse shooting, the ray from the 
shot-point at B to a geophone at A traverses the same path 
in the reverse direction. However, the t-x curves are 
different for the up-dip and down-dip shots. Let d A and d B 
be the perpendicular distances from the shot-points A and 
B to the interface at P and Q, respectively. For the down- 
dip shot at A the travel-time to distance x is given by 

t d = AC y DB + (3.129) 



A x B 



Fig. 3.72 Travel-time versus distance curves of direct and refracted rays 
for up-dip and down-dip profiles when the refracting boundary dips at 
angle 0. 

where t id is the intercept time for the down-dip shot: 


The geometry of Fig. 3.72 gives the following trigono¬ 
metric relationships: 


2d A 

hd~ cosi 0 


(3.134) 


AC = - 


DB = - 


PC = c/ A tan /. 


cosz„ COSI c 

DQ = </ B tan/ c 
CD = xcosd — (PC + DQ) 

we have 

(d A + d B ) xcosd - (d A + d B ) tan/' c 
?d = Fjcosig + V 2 


xcosd . + 


+ 


- cos /„ 


(3.130) 


F 2 F, 

Equation (3.131) can be simplified by noting that 


(3.131) 


d B = d A + xsind 


and 


1 


sm;„ 


^2 v x 


(3.132) 


After substitution and gathering terms the down-dip 
travel-time is given by 

xsinz'cosd xcosLsind 2<LcosL 
r d Fj V, F 


1 


1 


= T/ sin (i + d) + t- 


(3.133) 


The analysis for shooting in the up-dip direction is analo¬ 
gous and gives 

Q= y t sin O'c ~ 0) + f lu (3.135) 

where f iu is the intercept time for the up-dip shot: 

2 d„ 

hu = y^ C0si c (3-136) 

If the upper layer is homogeneous, the segments for 
the direct ray will have equal slopes, the reciprocals of 
which give the velocity F, of the upper layer. The seg¬ 
ments of the t-x curves corresponding to the doubly 
refracted ray are different for up-dip and down-dip shoot¬ 
ing. The total travel-times in either direction along ACDB 
must be equal, but the t-x curves have different intercept 
times. As these are proportional to the perpendicular dis¬ 
tances to the refractor below the shot-points, the up-dip 
intercept time t m is larger than the down-dip intercept 
time t id . This means that the slope of the up-dip refraction 
in Fig. 3.72 is flatter than the down-dip slope. If we 
interpret the reciprocal of the slope as the velocity of the 
lower medium, we get two apparent velocities, V d and F u , 
given by 
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^ = -^sin(( c + 0) 
= sin (? c — 0) 


(3.137) 


Once the real velocity V 1 and the apparent velocities 
F d and F u have been determined from the t-x curves, the 
dip of the interface 9 and the critical angle i c (and from it 
the true velocity V 2 of the lower layer) can be computed: 


0 = 


sin 


-l 


(3.138) 


i c = 2 I sm 


-l 


sm 


(3.139) 


If the reciprocal apparent velocities (Eq. (3.137)) are 
added, a simple approximation for the true velocity of the 
lower layer is obtained: 

V + V = V (sin (i c + 0) + sin (;' c - 6)) 

y d v u y 1 
2 • . 

= ,, sin/,.cos0 
V-, c 


= -p-COS0 


(3.140) 



Fig. 3.73 (a) The path of a seismic wave through a horizontally layered 
medium, in which the seismic velocity is constant in each layer and 
increases with increasing depth, becomes ever flatter until critical 
refraction is reached; the return path of each emerging ray mirrors the 
incident path, (b) When the velocity increases continuously with depth, 
the ray is a smooth curve that is concave upward. 


If the refractor dip is small, cos 6 ~ 1 (for example, if 9 
<15°, cos 0 > 0.96) and an approximate formula for the 
true velocity of the second layer is 


V 2 


v A 


(3.141) 


3.6.63 Refraction with continuous change of velocity with 
depth 

Imagine the Earth to have a multi-layered structure with 
numerous thin horizontal layers, each characterized by a 
constant seismic velocity, which increases progressively 
with increasing depth (Fig. 3.73). A seismic ray that 
leaves the surface with angle i 1 will be refracted at each 
interface until it is finally refracted critically. The ray that 
finally returns to the surface will have an emergence angle 
equal to t\. Snell's law applies to each successive refrac¬ 
tion (e.g., at the top surface of the nth layer, which has 
velocity VJ 


the velocity increases constantly with increasing depth. 
Each ray then has a smoothly curved path. If the vertical 
increase in velocity is linear with depth, the curved rays 
are circular arcs. 

In the above we have assumed that the refracting inter¬ 
faces are horizontal. This type of analysis is common in 
seismic prospecting, where only local structures and com¬ 
paratively shallow depths are evaluated. The passage of 
seismic body waves through a layered spherical Earth can 
be treated to a first approximation in the same way. We can 
represent the vertical (radial) velocity structure by subdi¬ 
viding the Earth into concentric shells, each with a faster 
body-wave velocity than the shell above it (Fig. 3.74). 
Snell’s law of refraction applies to the interface between 
each pair of shells. For example, at point A we can write 


sin/j sinaj 


(3.143) 


Multiplying both sides by r l gives 


sin i 1 sin i 2 

~rr = ~ff 


Smi n 

V. 


= constant = p 


rjsin/j rjsinaj 

( 3 . 142 ) 


(3.144) 


The constant p is called the ray parameter. It is charac¬ 
teristic for a particular ray with emergence angle /, and 
velocity V x in the surface layer. If V m is the velocity of the 
deepest layer, along whose surface the ray is eventually 
critically refracted (sin ; m = 1), then the value of p must be 
equal to 1/F m . 

As the number of layers increases and the thickness of 
each layer decreases, the situation is approached in which 


In triangles ACD and BCD, respectively, we have 
d= CjSinflj = jysina-, (3.145) 

Combining Eqs. (3.143), (3.144) and (3.145) gives the 
result 


rjSin/j r,sm/ 2 


r„ sm i„ 


V , 


F, 


V„ 


-=P 


(3.146) 
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Fig. 3.74 Refraction of a seismic ray in a spherically layered Earth, in 
which the seismic velocity is constant in each layer and the layer-velocity 
increases with depth. 


The constant p is again called the ray parameter, 
although it has different dimensions than in Eq. (3.142) 
for flat horizontal layers. Here the seismic ray is a straight 
line within each spherical layer with constant velocity. If 
the velocity increases continuously with depth, the 
seismic ray is refracted continuously and its shape is 
curved concavely upward. It reaches its deepest point 
when sin z = 1, at radius r Q where the velocity is V 0 ; these 
parameters are related by the Benndorf relationship: 


rsin; r o 

-r=v 0 =r 


(3.147) 


Determination of the ray parameter is the key to 
determining the variation of seismic velocity inside the 
Earth. Access to the Earth’s interior is provided by analy¬ 
sis of the travel-times of seismic waves that have traversed 
the various internal regions and emerge at the surface, 
where they are recorded. We will see in Section 3.7.3.1 
that the travel-time (t) of a seismic ray to a known epicen- 
tral distance (A) can be mathematically inverted to give 
the velocity V 0 at the deepest point of the path. The 
theory applies for P- and S-waves, the general velocity V 
being replaced by the appropriate velocity a or / 3 , respec¬ 
tively. 


3.7 INTERNAL STRUCTURE OF THE EARTH 

3.7.1 Introduction 

It is well known that the Earth has a molten core. What is 
now general knowledge was slow to develop. In order to 
explain the existence of volcanoes, some nineteenth 
century scientists postulated that the Earth must consist 
of a rigid outer crust around a molten interior. It was also 
known in the last century that the mean density of the 
Earth is about 5.5 times that of water. This is much larger 
than the known specific density of surface rocks, which is 
about 2.5-3. From this it was inferred that density 


increased towards the Earth’s center under the effect of 
gravitational pressure. The density at the Earth’s center 
was estimated to be comparatively high, greater than 7000 
kg m -3 and probably in the range 10,000-12,000 kg m“ 3 . 
It was known that some meteorites had a rock-like com¬ 
position, while others were much denser, composed 
largely of iron. In 1897 E. Wiechert, who subsequently 
became a renowned German seismologist, suggested that 
the interior of the Earth might consist of a dense metallic 
core, cloaked in a rocky outer cover. He called this cloak 
the ‘’Mantel,” which later became anglicized to mantle. 

The key to modern understanding of the interior of 
the Earth - its density, pressure and elasticity - was pro¬ 
vided by the invention of the Milne seismograph. The 
progressive refinement of this instrument and its system¬ 
atic employment world-wide led to the rapid development 
of the modern science of seismology. Important results 
were obtained early in the twentieth century. The Earth’s 
fluid core was first detected seismologically in 1906 by R. 
D. Oldham. He observed that, if the travel-times of P- 
waves observed at epicentral distances of less than 100° 
were extrapolated to greater distances, the expected 
travel-times were less than those observed. This meant 
that the P-waves arriving at large epicentral distances 
were delayed in their passage through the Earth. Oldham 
inferred from this the existence of a central core in which 
the P-wave velocity was reduced. He predicted that there 
would be a region of epicentral distances (a “shadow 
zone”) in which P-waves could not arrive. About this time 
it was found that P- and S-waves passed through the 
mantle but that no S-waves arrived beyond an epicentral 
distance of 105°. In 1914, B. Gutenberg verified the exis¬ 
tence of a shadow zone for P-waves in the range of epi¬ 
central distances between 105° and 143°. Gutenberg also 
located the depth of the core-mantle boundary with 
impressive accuracy at about 2900 km. A modern esti¬ 
mate of the radius of the core is 3485 ± 3 km, giving a 
mantle 2885 km thick. Gutenberg also predicted that P- 
waves and S-waves would be reflected from the 
core-mantle boundary. These waves, known today as PcP 
and ScS waves, were not observed until many years later. 
In honor of Gutenberg the core-mantle boundary is 
known today as the Gutenberg seismic discontinuity. 

While studying the P-wave arrivals from an earth¬ 
quake in Croatia in 1909 Andrija Mohorovigiq found only 
a single arrival (P g ) at distances close to the epicenter. 
Beyond about 200 km there were two arrivals; the P g 
event was overtaken by another arrival (P n ) which had 
evidently travelled at higher speed. Mohoroviqiq identi¬ 
fied P g as the direct wave from the earthquake and P n as a 
doubly refracted wave (equivalent to a head wave) that 
travelled partly in the upper mantle. Mohoroviqiq calcu¬ 
lated velocities of 5.6 km s -1 for P„ and 7.9 km s -1 for P n 
and estimated that the sudden velocity increase occurred 
at a depth of 54 km. This seismic discontinuity is now 
called the Mohorovigig discontinuity, or Moho for short. It 
represents the boundary between the crust and mantle. 
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Fig. 3.75 A SW-NE refraction 
seismic profile parallel to the 
strike of the Swiss Alps. The 
seismograms from adjacent 
stations have been modified 
to show reduced travel-times 
as a function of distance from 
the shot-point. The layered 
subsurface structure is traced 
by connecting main arrivals 
such as the crustal direct wave 

P , mantle head wave P , and 
g n 

mantle reflection PmP (after 

Maurer and Ansorge, 1992). 



The crustal thickness is known to be very variable. It aver¬ 
ages about 33 km, but measures as little as 5 km under 
oceans and as much as 60-80 km under some mountain 
ranges. 

The seismological Moho is commonly defined as the 
depth at which the P-wave velocity exceeds 7.6 km s -1 . 
This seismic definition is dependent on the density and 
elastic properties of crustal and mantle rocks, and need 
not correspond precisely to a change of rock type. An 
alternative definition of the Moho as the depth where the 
rock types change is called the petrological Moho. For 
most purposes the two definitions of the crust-mantle 
boundary are equivalent. 

It is now known that the crust is not homogeneous but 
has a vertically layered structure. In 1925 V. Conrad sepa¬ 
rated arrivals from a Tauern (Eastern Alps) earthquake of 
1923 into P g and S g waves in an upper crustal layer and 
faster P* and S* waves that travelled with velocities 6.29 
km s -1 and 3.57 km s _1 , respectively, in a deeper layer. 
Because the P* and S* velocities are significantly slower 
than corresponding upper mantle velocities, Conrad 
deduced that they were head waves from a lower crustal 
layer. The interface separating the continental crust into 
an upper crustal layer and a lower crustal layer is called 
the Conrad discontinuity. Influenced by early petrological 
models of crustal composition and by comparison with 
seismic velocities in known materials, seismologists 
referred to the upper and lower crustal layers as the 
granitic layer and the basaltic layer, respectively. This 
petrological separation is now known to be overly sim¬ 
plistic. In contrast to the Moho, which is found every¬ 
where, the Conrad discontinuity is poorly defined or 
absent in some areas. 

The appearance of some of these arrivals on refraction 
seismic records from the European continental crust is 
illustrated in Fig. 3.75. The vertical axis in this figure 
shows a reduced travel-time, obtained by dividing the 
shot-point to receiver distance by a representative crustal 
velocity (6 km s -1 in this case) and subtracting this from 
the observed travel-time. This method of displaying the 
data prevents the plot from becoming unwieldy at large 
distances. The crustal direct wave P g is represented as a 
nearly horizontal arrival, PmP is a P-wave reflected from 


the Moho, and P n is the upper mantle head wave along 
the Moho. Additional intracrustal reflections are labeled 
P2P, P3P, etc. 

Similar designations are used for events in seismic sec¬ 
tions of oceanic crust. The oceanic crustal layers consist 
of sedimentary Layer 1, basaltic Layer 2, and gabbroic 
Layer 3 (Section 3.7.5.1). The direct wave in Layer 2 is 
called P,,, the head wave at the basalt-gabbro interface is 
referred to as P 3 , and the Moho head wave is P n . 

The core shadow-zone and its interpretation in terms 
of a fluid core were well established in 1936 when Inge 
Lehmann, a Danish seismologist, reported weak P-wave 
arrivals within the shadow zone. She interpreted these in 
terms of an inner core with higher seismic velocity. 
However, the existence of the inner core remained contro¬ 
versial for many years. Improved seismometer design, 
digital signal treatment and the setting up of seismic 
arrays have provided corroborating evidence. The exis¬ 
tence of a solid inner core is also supported by analyses of 
the Earth’s natural vibrations. 

The gross internal structure of the Earth is modelled as 
a set of concentric shells, corresponding to the inner core, 
outer core and mantle (Fig. 3.76). An important step in 
understanding this layered structure has been the develop¬ 
ment of travel-time curves for seismic rays that pass 
through the different shells. To facilitate identification of 
the arrivals of these rays on seismograms, a convenient 
shorthand notation is used. A P- or S-wave that travels 
from an earthquake directly to a seismometer is labelled 
with the appropriate letter P or S; until the margin of the 
core shadow-zone, the P- and S-waves follow identical 
curved paths. (The curvature, as explained in Section 
3.6.6.3, arises from the increase in seismic velocity with 
depth.) A wave that reaches the seismometer after being 
reflected once from the crust is labelled PP (or SS), as its 
path consists of two identical P- or S-segments. 

The energy of an incident P- or S-wave is partitioned 
at an interface into reflected and refracted P- and S-waves 
(see Section 3.6.4). A P-wave incident on the boundary 
between mantle and fluid outer core is refracted towards 
the normal to the interface, because the P-wave velocity 
drops from about 13 km s -1 to about 8 km s -1 at the 
boundary. After a second refraction it emerges beyond 
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Fig. 3.76 Seismic wave paths of some important refracted and 
reflected P-wave and S-wave phases from an earthquake with focus at 
the Earth's surface. 

the shadow zone and is called a PKP wave (the letter K 
stands for Kern, the German word for core). An S-wave 
incident at the same point has a lower mantle velocity of 
about 7 km s -1 . Part of the incident energy is converted to 
a P-wave in the outer core, which has a higher velocity of 
8 km s _1 . The refraction is away from the normal to the 
interface. After a further refraction the incident S-wave 
reaches the surface as an SKS phase. A P-wave that 
travels through mantle, fluid core and inner core is 
labelled PKIKP. Each of these rays is refracted at an 
internal interface. To indicate seismic phases that are 
reflected at the outer core boundary the letter c is used, 
giving rise, for example, to PcP and ScS phases (Fig. 
3.76). Reflections from the inner core are designated with 
the letter i, as for example in the phase PKiKP. 

3.7.2 Refractions and reflections in the Earth's interior 

If it possesses sufficient energy, a seismic disturbance may 
be refracted and reflected - or converted from a P-wave to 
an S-wave, or vice versa - many times at the several 
seismic discontinuities within the Earth and at its free 
surface. As a result, the seismogram of a large earthquake 
contains numerous overlapping seismic signals and the 
identification of individual phases is a difficult task. Late- 
arriving phases that have been multiply reflected or that 
have travelled through several regions of the Earth’s inte¬ 
rior are difficult to resolve from the disturbance caused by 
earlier arrivals. In the period 1932-1939 H. Jeffreys and 
K. E. Bullen analyzed a large number of good records of 
earthquakes registered at a world-wide, though sparse, 
distribution of seismic stations. In 1940 they published a 
set of tables giving the travel-times of P- and S-waves 
through the Earth. A slightly different set of tables was 
reported by B. Gutenberg and C. F. Richter. The good 



Fig. 3.77 Travel-time versus epicentral distance (f-A) curves for some 
important seismic phases (modified from Jeffreys and Bullen, 1940). 

agreement of the independent analyses confirmed the 
reliability of the results. The Jeffreys-Bullen seismologi- 
cal tables were used by the international seismic commu¬ 
nity as the standard of reference for many years. 

The travel-time of a seismic wave to a given epicentral 
distance is affected by the focal depth of the earthquake, 
which may be as much as several hundred kilometers. 
The travel-time versus distance curves of some impor¬ 
tant phases are shown in Fig. 3.77 for an earthquake 
occurring at the Earth’s surface. The model assumes that 
the Earth is spherically symmetric, with the same verti¬ 
cal structure underneath each place on the surface. This 
assumption works fairly well, although it is not quite 
true. Lateral variations of seismic velocity have been 
found at many depths within the Earth. For example, 
there are lateral differences in seismic velocity between 
oceanic and continental crust, and between oceanic and 
continental lithosphere. At even greater depths signifi¬ 
cant lateral departures from the spherical model have 
been detected. These discrepancies form the basis of the 
branch of seismology called seismic tomography, which 
we will examine later. 

3.7.2.1 Seismic rays in a uniformly layered Earth 

It is important to understand clearly the relationship 
between the travel-time (?) versus epicentral distance (A) 
curves and the paths of seismic waves in the Earth, like 
those shown in Fig. 3.76. Consider first an Earth that 
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Fig. 3.78 Seismic wave paths 
and their f-A curves for P- 
waves passing through a 
spherical Earth with constant 
velocities in the mantle, outer 
core and inner core, 
respectively, (a) Development 
of a shadow zone when the 
mantle velocity (I/,) is higher 
than the outer core velocity 
(V 2 ). (b) Penetration of the 
shadow zone by rays 
refracted in an inner core with 
higher velocity than the outer 
core (V 3 >V 2 ). 






consists of two concentric shells representing the mantle 
and core (Fig. 3.78a). The P-wave velocity in each shell is 
constant, and is faster in the mantle than in the core (V l > 
V 2 ). The figure shows the paths of 18 rays that leave a 
surface source at angular intervals of 5°. Rays 1-12 travel 
directly through the mantle as P-waves and emerge at 
progressively greater epicentral distances. The convex 
upwards shape of the t -A curve is here due to the curved 
outer surface, the layer velocity being constant. Ray 13 is 
partially refracted into the core, and partially reflected 
(not shown in the figure). Because V 2 < V 1 the refracted 
ray is bent towards the Earth’s radius, which is normal to 
the refracting interface. This ray is further bent on leaving 
the core and reaches the Earth’s surface as a PKP phase at 
an epicentral distance greater than 180°. Rays 14 and 15 
impinge more directly on the core and are refracted less 
severely; their epicentral distances become successively 
smaller and their travel-times become shorter than that of 
ray 13, as indicated by branch 13-15 of the f-A curve. 
This branch is offset in time from the extrapolation of 
branch 1-12 because of the lower velocity in the core. The 
paths of rays 16, 17 and 18 (which is a straight line 
through mantle and core and emerges at an epicentral dis¬ 
tance of 180°) become progressively longer, and a second 
branch 16-18 develops on the t -A curve. The two 
branches meet at a sharp point, or cusp. No P-waves 
reach the surface in the gap between rays 12 and 15 in this 
simple model. There is a shadow zone between the last P- 
wave that just touches the core and the PKP-wave with 
the smallest epicentral distance. The existence of a 
shadow zone for P-waves is evidence for a core with lower 
P-wave velocities than the mantle. S-waves in the mantle 
follow the same ray paths as the P-waves. However, no 
direct S-waves arrive in the shadow zone, which indicates 
that the core must be fluid. 

Now suppose an inner core with a constant velocity 
F 3 that is higher than the velocity V 2 in the outer core 


(Fig. 3.78b). The paths of rays 1-15 are the same as 
before, through the mantle and outer core. The segments 
1-12 and 13-15 of the f-A curve are the same as previ¬ 
ously. Ray 16 impinges on the inner core and is sharply 
refracted away from the Earth’s radius; on returning to 
the outer core it is again refracted, back towards the 
radius. After further refraction at the core-mantle inter¬ 
face this ray emerges at the Earth's surface at a smaller 
epicentral distance than ray 15, within the P-wave 
shadow-zone, as a PKIKP event. Successive PKIKP rays 
are bent less strongly. The PKIKP rays map out a new 
branch 16-18 of the f-A curve (see also Fig. 3.77). 

3.72.2 Travel-time curves for P-, PKP- and PKIKP-waves 

In general the velocities of P-waves and S-waves increase 
with depth. As described in Section 3.6.6.3 and illustrated 
in Fig. 3.73a, the ray paths are curved lines, concave 
towards the surface. However, the explanation of the 
paths of P, PKP, and PKIKP phases shown in Fig. 3.79 
closely follows the preceding discussion. There is a 
shadow zone for direct P-waves between about 103° and 
143°, and no direct S-waves are found beyond 103°. The 
shallowest PKP ray (A in Fig. 3.79) is deviated the fur¬ 
thest, emerging at an epicentral distance greater than 
180°. Successively deeper PKP rays (B-E) emerge at ever- 
smaller epicentral distances until about 143°, after which 
the epicentral distance increases again to almost 170° 
(rays F, G). It was long believed that the boundary 
between inner and outer core was a transitional region 
(called region F in standard Earth models) with higher P- 
wave velocity and that PKP rays traversing this region 
would again emerge at smaller epicentral distances 
(ray H). The first rays penetrating the inner core are 
sharply refracted and emerge in the P-wave shadow zone. 
The most strongly deviated (ray I) is observed at an epi¬ 
central distance of about 110°; deeper rays (J-P) arrive at 
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Fig. 3.79 The wave paths of 
some P, PKP, and PKIKP rays 
(after Gutenberg, 1959). 



ever-greater distances up to 180°. There are at least two 
branches of the t -A curve for A> 143°, corresponding to 
the PKP and PKIKP phases, respectively (Fig. 3.77). In 
fact, depending how the transitional region F is modelled, 
the ?-A curve near 143° can have several branches. 

The edges of the shadow zone defined by P and PKP 
phases are not sharp. One reason is the intrusion of 
PKIKP phases at the 143° edge. Another is the effect of 
diffraction of P-waves at the 103° edge (Fig. 3.79). The 
bending of plane waves at an edge into the shadow of an 
obstruction was described in Section 3.6.2.3, and 
explained with the aid of Huygen’s principle. The 
diffraction of plane waves is called Fraunhofer diffraction. 
When their source is not at infinity, waves must be 
handled as spherical waves. Spherical wavefronts that 
pass an obstacle are also diffracted. This type of behavior 
is called Fresnel diffraction, and it is also explainable with 
Huygen’s principle as the product of interference between 
the primary wavefront and secondary waves generated at 
the obstacle. Wave energy penetrates into the shadow 
of the obstacle, as though the wavefront were bent around 
the edge. In this way very deep P-waves are diffracted 
around the core and into the shadow zone. The intensity 
of the diffracted rays falls off with increasing angular dis¬ 
tance from the diffracting edge, in this case the core¬ 
mantle boundary. Modern instrumentation enables 
detection of long-period diffracted P-waves to large epi- 
central distances (Figs. 3.77, 3.79). The velocity structure 
above the core-mantle boundary, in particular in the D"- 
layer (Section 3.7.5.3), has a strong influence on the ray 
paths, travel-times and waveforms of the diffracted waves. 

3.7.3 Radial variations of seismic velocities 

Models of the radial variations of physical parameters 
inside the Earth implicitly assume spherical symmetry. 
They are therefore “average” models of the Earth that do 


not take into account lateral variations (e.g., of velocity 
or density) at the same depth. This is a necessary first step 
in approaching the true distributions as long as lateral 
variations are relatively small. This appears to be the case; 
although geophysically significant, the lateral variations 
in physical properties remain within a few percent of the 
average value at any depth. 

There are two main ways to determine the distribu¬ 
tions of body-wave velocities in a spherically symmetric 
Earth. They are referred to as forward and inverse model¬ 
ling. Both methods have to employ the same sets of obser¬ 
vations, which are the travel-times of different seismic 
phases to known epicentral distances. The forward tech¬ 
nique starts with a known or assumed variation of 
seismic velocities and calculates the corresponding travel- 
times. The inversion method starts with the observed t —A 
curves and computes a model of the velocity distributions 
that could produce the curves. The inversion method is 
the older one, in use since the early part of the twentieth 
century, and forms an important branch of mathematical 
theory. Forward modelling is a more recent method that 
has been successfully employed since the advent of pow¬ 
erful computers. 

3.7.3.1 Inversion of travel-time versus distance curves 

In 1907 the German geophysicist E. Wiechert, building 
upon an evaluation of the Benndorf problem (Eq. 
(3.147)) by the mathematician G. Herglotz, developed 
an analytical method for computing the internal distrib¬ 
utions of seismic velocities from observations made at 
the Earth's surface. The technique is called inversion of 
travel-times, and it is considered one of the classical 
methods of geophysics. The observational data consist 
of the t —A curves for important seismic phases 
(Fig. 3.77). The clues to deciphering the velocity dis¬ 
tributions were the Benndorf relationship for the ray 
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Focus 



Fig. 3.80 Paths of two rays that leave an earthquake focus at 
infinitesimally different angles, reaching the surface at points P and P' at 
epicentral distances A and A + dA, respectively. 


The Herglotz-Wiechert inversion is valid for regions of 
the Earth in which p varies monotonically with A. It 
cannot be used in the Earth's crust, because conditions are 
too inhomogeneous. Seismic velocity distributions in the 
crust are deduced empirically from long seismic refraction 
profiles. The Elerglotz-Wiechert method cannot be used 
where a low-velocity zone is present, because the ray does 
not bottom in the zone. It works well for the Earth’s 
mantle, but care must be taken where a seismic discontinu¬ 
ity is present. In the Earth's core the refraction of P-waves 
at the core-mantle boundary means that no PKP waves 
reach their deepest point in the outer layers of the core 
(Fig. 3.79). However, SKS-waves bottom in the outer core. 
Inversion of SKS-wave travel-times complements the 
inversion of PKP-wave data to give the P-wave velocity 
distribution in the core. 


parameter p (Eq. (3.147)) and the recognition that the 
value of p can be obtained from the slope of the travel¬ 
time curve at the epicentral distance where the ray 
returns to the surface. 

Consider two rays that leave an earthquake focus at 
infinitesimally different angles, reaching the surface at 
points P and P' at epicentral distances A and A + dA, 
respectively (Fig. 3.80). The distance PP' is R dA (where R 
is the Earth’s radius) and the difference in arc-distances 
along the adjacent rays is equal to V d t, where V is the 
velocity in the surface layer and dr is the difference in 
travel-times of the two rays. In the small triangle PP'Q the 
angle QPP' is equal to i, the angle of emergence (inci¬ 
dence). Therefore, 


V U( 

Sln 1 ~ RdA 

R sin; _ _ dp 
V — ^ — dA 


(3.148) 

(3.149) 


This means that the value of p for the ray emerging at 
epicentral distance A can be obtained by calculating the 
slope (dr/dA) of the travel-time curve for that distance. 
This is an important step in finding the velocity V 0 at the 
deepest point of the ray, at radius r () , because F 0 = r Q /p. 
However, before we can find the velocity we need to know 
the value of r () . The continuation of the analysis - known 
as Herglotz-Wiechert inversion - is an intricate mathe¬ 
matical procedure, beyond the scope of this text, which 
fortunately results in a fairly simple formula: 


3.73.2 Forward modelling: polynomial parametrization 

The forward modelling method starts with a presupposed 
dependence of seismic velocity with depth. The method 
assumes that the variation of velocity can be expressed by 
a smooth polynomial function of radial distance within 
limited depth ranges. This procedure is called polynomial 
parametrization and in constructing models of the 
Earth’s interior it is applied to the P-wave and S-wave 
velocities, the seismic attenuation, and the density. 

The travel-times of P- and S-waves to any epicentral 
distance are calculated on the basis of the spherically 
symmetric, layered model. The computed travel-times are 
compared with the observed t -A curves, and the model is 
adjusted to account for differences. The procedure is 
repeated as often as necessary until an acceptable agree¬ 
ment between the computed and real travel-times is 
achieved. The method requires good travel-time data for 
many seismic phases and involves intensive computation. 

In 1981 A. M. Dziewonski and D. L. Anderson con¬ 
structed a Preliminary Reference Earth Model (acronym: 
PREM) in which the distributions of body-wave veloci¬ 
ties in important layers of the Earth were represented by 
cubic or quadratic polynomials of normalized radial dis¬ 
tance; in thin layers of the upper mantle linear relation¬ 
ships were used. A similar, revised parametrized velocity 
model ( iasp91 ) was proposed by B. L. N. Kennett and E. 
R. Engdahl in 1991. The variations of P- and S-wave 
velocities with depth in the Earth according to the iasp91 
model are shown in Fig. 3.81. 


l% = ^jcosh-i(^)dA (3.150) 

where p(Aj) is the slope of the /-A curve at A p the epicen¬ 
tral distance of emergence, and p(A) is the slope at any 
intermediate epicentral distance A. Equation (3.150) can 
be used to integrate numerically along the ray to give the 
value of r 0 for the ray, and V Q = r Q ip. 


3.7.4 Radial variations of density, gravity and pressure 

In order to determine density, gravity and pressure in the 
Earth’s interior several simplifying assumptions must be 
made, which appear to be warranted by experience. The 
Earth is assumed to be spherically symmetric and com¬ 
posed of concentric homogeneous shells or layers (e.g., 
inner core, outer core, mantle, etc.). Possible effects of 
chemical and phase changes within a shell are not taken 
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Fig. 3.81 The variations with depth of longitudinal- and shear-wave 
velocities, a and b, respectively, in the Earth's interior, according to the 
Earth model i asp91 (data source: Kennett and Engdahl, 1991). 


into account. Pressure and density are assumed to increase 
purely hydrostatically. If the distributions of seismic 
body-wave velocities a and /3 are known, an important 
seismic parameter can be defined: 

<P=a 2 -|8 2 (3.151) 

By comparing Eq. (3.151) and Eq. (3.48) we see that d> 
is equal to Kip , where K is the bulk modulus and p the 
density. 


3.7.4.1 Density inside the Earth 

Consider a vertical prism between depths z and dz (Fig. 
3.82). The hydrostatic pressure increases from p at depth z 
to ip + dp) at depth (z + dz) because of the extra pressure 
due to the extra material in the small prism of height dz. 
The pressure increase dp is equal to the weight w of the 
prism divided by the area A of the base of the prism, over 
which the weight is distributed. 

w (volume X p)g (Ad:p)g 

d P = A = - A - = ~ 4~ 

— pgd: = — pgdr (3.152) 


From the definition of bulk modulus, K (Eq. (3.17)) we 
can write 


K= - V 


dp 

dV 



(3.153) 


Combining Eqs. (3.151), (3.152) and (3.153) gives 


~ d P _ P _ 1 
pgdr K <f> 


(3.154) 


V 



: area : 
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base 
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Fig. 3.82 Computation of hydrostatic pressure in the Earth, assuming 
that a change in pressure dp with depth increase dz is due only to the 
increase in weight of overlying material. 


dp = P(r)gjr) 
dr 4>(r) 


(3.155) 


Equation (3.155) is known as the Adams-Williamson 
equation. It was first applied to the estimation of density in 
the Earth in 1923 by E. D. Williamson and L. H. Adams. It 
yields the density gradient at radius r, when the quantities 
on the right-hand side are known. The seismic parameter 
<f> is known accurately, but the density p is unknown; it is in 
fact the goal of the calculation. The value of gravity g used 
in the equation must be computed separately for radius r. 
It is due only to the mass contained within the sphere of 
radius r, because external (homogeneous) shells of the 
Earth do not contribute to gravitation inside them. This 
mass is the total mass E of the Earth minus the cumulative 
mass of all spherical shells external to r. 

The procedure requires that a starting value for p be 
assumed at a known depth. Analyses of crustal and upper 
mantle structure in isostatically balanced areas give esti¬ 
mates of upper mantle density around 3300 kg m -3 . 
Using this as a starting value at an initial radius r l the 
density gradient in the uppermost mantle can be calcu¬ 
lated from Eq. (3.155). Linear extrapolation of this gradi¬ 
ent to a chosen greater depth gives the density p at radius 
ry the corresponding new value of g can be calculated by 
subtracting the mass of the shell between the two depths; 
together with the value of 4>(r 2 ) the density gradient can 
be computed at r 2 . This iterative type of calculation gives 
the variation of density with depth (or radius). Fine steps 
in the extrapolations give a smooth density distribution 
(Fig. 3.83). There are two important boundary conditions 
on the computed density distribution. Integrated over the 
Earth’s radius it must give the correct total mass 
(E= 5.974 XlO 24 kg). It must also fulfil the relationship 
C = 0.3308£'T? 2 between Earth’s moment of inertia (C), 
mass (E) and radius ( R ), as explained in Section 2.4.3. 

The density changes abruptly at the major seismic 
discontinuities (Fig. 3.83), showing that it is affected 
principally by changes in composition. If it could be mea- 
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Fig. 3.83 Radial distribution of density within the Earth according to 
Earth model PREM (data source: Dziewonski, 1989). 

sured at normal sea-level pressure and temperature, the 
density would be found to be around 4200 kg m” 3 in the 
mantle, 7600 kg m -3 in the outer core and 8000 kg m -3 in 
the inner core. The smooth increase in density between 
the major compositional discontinuities is the result of 
the increases in pressure and temperature with depth. 

3.74.2 Gravity and pressure inside the Earth 

The radial variation of gravity can be computed from the 
density distribution. As stated above, the value of g(r) is 
due only to the mass m(r) contained within the sphere of 
radius r. Let the density at radius x (< r) be p(x); the 
gravity at radius r is then given by 

g(r) = — G ^ ~ ~ ^ J4ttx 2 p(x) dx (3.156) 

A remarkable feature of the internal gravity (Fig. 3.84) 
is that it maintains a value close to 10 m s -2 throughout 
the mantle, rising from 9.8 m s -2 at the surface to 
10.8m s -2 at the core-mantle boundary. It then decreases 
almost linearly to zero at the Earth's center. 

Flydrostatic pressure is due to the force (N) per unit 
area (m 2 ) exerted by overlying material. The SI unit of 
pressure is the pascal (1 Pa = 1 N m -2 ). In practice this is a 
small unit. The high pressures in the Earth are commonly 
quoted in units of gigapascal (1 GPa = 10 9 Pa), or alter¬ 
natively in kilobars or megabars (1 bar = 10 5 Pa; 1 kbar = 
10 8 Pa; 1 Mbar = 10 11 Pa = 100 GPa). 

Within the Earth the hydrostatic pressure p(r ) at radius 
r is due to the weight of the overlying Earth layers 
between r and the Earth's surface. It can be computed by 
integrating Eq. (3.152) using the derived distributions of 
density and gravity. This gives 
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Fig. 3.84 Radial variations of internal gravity (thick curve) and pressure 
(thin curve) according to Earth model PREM (data source: Dziewonski, 
1989). 


P O’) = Jp(r)g(r) dr (3.157) 

r 

The pressure increases continuously with increasing 
depth in the Earth (Fig. 3.84). The rate of increase (pres¬ 
sure gradient) changes at the depths of the major seismic 
discontinuities. The pressure reaches a value close to 380 
GPa (3.8 Mbar) at the center of the Earth, which is about 
4 million times atmospheric pressure at sea-level. 

3.7.5 Models of the Earth's internal structure 

Once the velocity distributions of P- and S-waves inside 
the Earth were known the broad internal structure of the 
Earth - crust, mantle, inner and outer core - could be 
further refined. In 1940-1942 K. E. Bullen developed a 
model of the internal structure consisting of seven con¬ 
centric shells. The boundaries between adjacent shells 
were located at sharp changes in the body-wave veloci¬ 
ties or the velocity gradients. For ease of identification 
the layers were labelled A-G (Table 3.4); this nomencla¬ 
ture has been carried over into more modern models. 
The seismic layering of the Earth is better known 
than the composition of the layers, which must be 
inferred from laboratory experiments and petrological 
modelling. 

In the original Model A the density distribution was 
not well constrained. Two different density distributions 
(A and A') that fitted the known mass and moment 
of inertia gave disparate central densities of 17,300 
kg m -3 and 12,300 kg m~ 3 , respectively. In 1950 Bullen 
presented Earth Model B, in which the bulk modulus 
(K) and seismic parameter (<t>) were assumed to vary 
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Table 3.4 Comparison of Earth’s internal divisions according to Model A (Bullen, 1942) and PREM (Dziewonski and 
Anderson, 1981) 




PREM 


Model A 

Region [km] 

Layer 

Depth range 
[km] 

Comments 

A (0-33) 

crust: 

upper 

lower 

0-15 

15-24 

lithosphere 0-80 km 

B (33M40) 

upper mantle: 
uppermost mantle 

24-80 



low-velocity layer 

80-220 

asthenosphere 


transition zone 

220M00 


C(410-1000) 

upper mantle: 
transition zones 

400-670 

670-770 


D(1000-2900) 

lower mantle: 
layer D' 

770-2740 



layer D" 

2740-2890 


E (2900M980) 

outer core 

2890-5150 


F (4980-5120) 

transition layer 



G(5120-6370) 

inner core 

5150-6370 



smoothly with pressure below a depth of 1000 km. 
The model suggested a central density around 18,000 
kg m -3 . 

In the 1950s the development of long-period seismo¬ 
graphs made possible the observation of the natural oscil¬ 
lations of the Earth. After very large earthquakes 
numerous modes of free oscillation are excited with 
periods up to about one hour (Section 3.3.4). These were 
first observed unambiguously after the huge Chilean 
earthquake of 1960. The free oscillations form an inde¬ 
pendent constraint on Earth models. The lowest- 
frequency spheroidal modes involve radial displacements 
that take place against the restoring force of gravitation 
and are therefore affected by the density distribution. 
Starting from a spherically symmetric Earth model with 
known distributions of density and elastic properties, the 
forward problem consists of calculating how such a model 
will reverberate. The calculated and observed normal 
modes of oscillation are compared and the model is 
adjusted until the required fit is obtained. The inverse 
problem consists of computing the model of density and 
elastic properties by inverting the frequency spectrum of 
the free oscillations. The parametrized model PREM, 
based upon the inversion of body-wave, surface-wave and 
free-oscillation data, is the current standard model of the 
Earth's internal structure. It predicts a central density of 
13,090 kgm -3 . 


3.7.5.1 The crust 

The Earth’s crust corresponds to Bullen’s region A. The 
structures of the crust and upper mantle are complex and 
show strong lateral variations. This prohibits using the 
inversion of body-wave travel-times to get a vertical dis¬ 
tribution of seismic velocities. The most reliable informa¬ 
tion on crustal seismic structure conies from seismic 
refraction profiles and deep crustal reflection sounding. 
The variation of seismic velocity with depth in the crust 
differs according to where the profiles are carried out. 
Ancient continental shield domains have different verti¬ 
cal velocity profiles than younger continental or oceanic 
domains. In view of this variability any generalized 
petrological model of crustal structure is bound to be an 
oversimplification. However, with this reservation, it is 
still possible to summarize some general features of 
crustal structure, and the corresponding petrological 
layering. 

A generalized model of the structure of oceanic crust is 
shown in Fig. 3.85. Oceanic crust is only 5-10 km thick. 
Under a mean water depth of about 4.5 km the top part of 
the oceanic crust consists of a layer of sediments that 
increases in thickness away from the oceanic ridges. The 
igneous oceanic basement consists of a thin (—0.5 km) 
upper layer of superposed basaltic lava flows underlain by 
a complex of basaltic intrusions, the sheeted dike complex. 
Below this the oceanic crust consists of gabbroic rocks. 
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Fig. 3.85 Generalized petrological model and P-wave velocity-depth 
profile for oceanic crust. 


The vertical structure of continental crust is more 
complicated than that of oceanic crust, and the structure 
under ancient shield areas differs from that under 
younger basins. It is more difficult to generalize a repre¬ 
sentative model (Fig. 3.86). The most striking difference is 
that the continental crust is much thicker than oceanic 
crust. Under stable continental areas the crust is 35-40 
km thick and under young mountain ranges it is often 
50-60 km thick. The continental Moho is not always a 
sharp boundary. In some places the transition from crust 
to mantle may be gradual, with a layered structure. 
Originally, the Conrad seismic discontinuity was believed 
to separate an upper crustal (granitic) layer from a lower 
crustal (basaltic) layer. However, the Conrad discontinu¬ 
ity is not found in all places and there is some doubt as to 
its real nature; it may represent a more complicated com¬ 
positional or phase boundary. Crustal velocity studies 
have defined two anomalous zones that often disrupt the 
otherwise progressive increase of velocity with depth. A 
low-velocity layer within the middle crust is thought to be 
due to intruded granitic laccoliths; it is called the sialic 
low-velocity layer. It is underlain by a middle crustal layer 
composed of migmatites. Below this layer the velocity 
rises sharply, forming a “tooth” in the velocity profile. 
This tooth and the layer beneath it often make up a thinly 
layered lower crust. Refractions and reflections at the 
top of the tooth are thought to explain the Conrad 
discontinuity. 


3.7.52 The upper mantle 

In his 1942 model of the Earth’s interior (see Table 3.4) 
Bullen made a distinction between the upper mantle 
(layers B and C) and the lower mantle (layer D). The 
upper mantle is characterized by several discontinuities of 
body-wave velocities and steep velocity gradients (Fig. 
3.87). The top of the mantle is defined by the Mohorovifig 
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Fig. 3.86 Generalized petrological model and P-wave velocity-depth 
profile for continental crust (after Mueller, 1977). 


discontinuity (Moho), below which the P-wave velocity 
exceeds 7.6 km s -1 . The Moho depth is very variable, with 
a global mean value around 3(M0 km. A weighted 
average of oceanic and continental structures equal to 
24.4 km is used in model PREM. The assumption of a 
spherically symmetric Earth does not hold well for the 
crust and upper mantle. Lateral differences in structure 
are important down to depths of at least 400 km. The 
uppermost mantle between the Moho and a depth of 
80-120 km is rigid, with increasing P- and S-wave veloci¬ 
ties. This layer is sometimes called the lid of the underlying 
low-velocity layer. Together with the crust, the lid forms 
the lithosphere, the rigid outer shell of the Earth that takes 
part in plate tectonic processes (see Section 1.2). The 
lithosphere is subdivided laterally into tectonic plates that 
may be as large as 10,000 km across (e.g., the Pacific plate) 
or as small as a few thousand kilometers (e.g., the 
Philippines plate). The plates are very thin in comparison 
to their horizontal extent. 

An abrupt increase of P- and S-wave velocities by 
3-4% has been observed at around 220 ± 30 km depth; it 
is called the Lehmann discontinuity. Like the Conrad dis¬ 
continuity in the crust it is not found everywhere and its 
true meaning is in question. Between the lid and the 
Lehmann discontinuity, in the depth range 100-200 km, 
body-wave velocity gradients are weakly negative, i.e., the 
velocities decrease with increasing depth. This layer is 
called the low-velocity layer (LVL). Its nature cannot be 
evaluated from body waves, because they do not bottom 
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Fig. 3.87 (a) P-wave 
velocity-depth profile in the 
upper mantle beneath the 
Canadian shield, and (b) ray 
paths through the model; 
note the sharp bending of 
rays at the velocity 
discontinuities at depths of 
400 km and 670 km (after 
LeFevre and Helmberger, 
1989). 



in the layer and its lower boundary is not sharp. The 
evidence for understanding the LVL comes from the 
inversion of surface-wave data. Only long-period surface 
waves with periods longer than about 200 s can penetrate 
to the depths of the base of the LVL. The data from 
surface waves are not precise. Their depth resolution is 
poor and only the S-wave velocity can be determined. 
Thus, the top and bottom of the LVL are not sharply 
defined. 

The LVL is usually associated with the asthenosphere, 
which also plays an important role in plate tectonic theory. 
The decreases in seismic velocities are attributed to 
reduced rigidity in this layer. Over geological time inter¬ 
vals the mantle reacts like a viscous medium, with a vis¬ 
cosity that depends on temperature and composition. 
From the point of view of plate tectonics the asthenos¬ 
phere is a viscous layer that decouples the lithosphere 
from the deeper mantle; by allowing slow convection, it 
permits or promotes the relative motions of the global 
plates. The LVL and asthenosphere reflect changes of rhe¬ 
ological properties of the upper mantle material. Brittle 
behavior in the crust and lithosphere gives way with 
increasing depth to ductile behavior in the low-rigidity 
asthenosphere. The brittle-ductile transition is gradual 
and depends on properties such as the rock composition, 
geothermal gradient, initial crustal thickness and strain 
rate. It probably occurs differently in oceanic and conti¬ 
nental lithosphere, as suggested in Fig. 2.69. 

The composition of the upper mantle is generally 
taken to be peridotitic, with olivine [(Mg,Fe) 2 Si0 4 ] as the 
dominant mineral. With increasing depth the hydrostatic 
pressure increases and eventually causes high-pressure 
transformation of the silicate minerals. This is reflected in 
the seismic properties. Travel-time-A curves of body 
waves show a distinct change in slope at epicentral dis¬ 
tances of about 20°. This is attributed to a discontinuity 
in mantle velocities at a depth of around 400 km (Fig. 
3.87). The 400 km discontinuity (or 20° discontinuity) is 
interpreted as due to a petrological change from an 
olivine-type lattice to a more closely packed spinel-type 
lattice. 


The lid, low-velocity layer and the zone down to the 
400 km discontinuity together correspond to layer B in 
Bullens Earth Model A. A further seismic discontinuity 
occurs at a depth of 650-670 km. This is a major feature 
of mantle structure that has been observed world-wide. In 
the transition zone between the 400 km and 670 km dis¬ 
continuities there is a further change in structure from 13- 
spinel to y-spinel, but this is not accompanied by 
appreciable changes in physical properties. At 670 km the 
spinel transforms to perovskite. This transition zone cor¬ 
responds to the upper part of Bullen’s layer C. 

3.7.53 The lower mantle 

The lower mantle is now classified as the part below the 
important seismic discontinuity at 670 km. Its composi¬ 
tion is rather poorly known, but it is thought to consist of 
oxides of iron and magnesium as well as iron-magnesium 
silicates with a perovskite structure. The uppermost part 
of the lower mantle between 670 and 770 km depth has a 
high positive velocity gradient and corresponds to the 
lower part of Bullen’s layer C. Beneath it lies Bullen’s 
layer D', which represents a great thickness of normal 
mantle, characterized by smooth velocity gradients and 
the absence of seismic discontinuities. 

Just above the core-mantle boundary an anomalous 
layer, approximately 150-200 km thick, has been identi¬ 
fied in which body-wave velocity gradients are very small 
and may even be negative. Although part of the lower 
mantle, it evidently serves as a boundary layer between 
the mantle and core. It is labelled D" to distinguish it from 
the normal mantle above it. The structure and role of the 
D" layer are not yet known with confidence, but it is the 
focus of intensive current research. Models of the inter¬ 
nal structure of D" have been proposed with positive 
velocity gradients, others with negative velocity gradients, 
and some with small velocity discontinuities. The latter 
possibility is important because it would imply some 
stratification within D". 

The most interesting aspect of D" is the presence - 
revealed by seismic tomographic imaging (Section 3.7.6.2) 
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- of velocity variations of several percent that take place 
over lateral distances comparable in size to the continents 
and oceans in Earth’s crust. The term “grand structure” 
has been coined for these regions; the thicker parts have 
also been termed “crypto-continents” and the thinner 
parts “crypto-oceans” (see Fig. 4.38). Moreover, the seis- 
mically fast regions (in which temperatures are presumed 
to be cooler than normal) lie beneath present subduction 
zones. This suggests that cold subducted lithosphere may 
eventually sink to the bottom of the mantle where it is 
colder and more rigid than the surrounding mantle and 
hence has higher body-wave velocities. A large low-veloc¬ 
ity (hot) region underlies the Pacific basin, in which many 
centers of volcanism and locally high heat flow 
(“hotspots,” see Section 1.2.8) are located. The D" layer is 
suspected of being the source of the mantle plumes that 
cause these anomalies. Exceptionally hot material from D" 
rises in thin pipe-like mantle plumes to the 670 km discon¬ 
tinuity, which opposes further upward motion in the same 
way that it resists the deeper subduction of cold lithos¬ 
pheric slabs. Occasionally a hot plume is able to break 
through the 670 km barrier, producing a surface hotspot. 

If our current understanding of the D" layer is correct, 
it plays an important role in geodynamic and geothermal 
behavior. On the one hand, D” serves as the source of 
material for the mantle plumes that give rise to hotspots, 
which are important in plate tectonics. On the other hand, 
the thermal properties of D" could influence the outward 
transport of heat from the Earth’s core; in turn, this could 
affect the intricate processes that generate the Earth’s 
magnetic field. 

3.7.54 The core 

Early in Earth’s history, dense metallic elements are 
thought to have settled towards the Earth’s center, 
forming the core, while lighter silicates ascended and 
solidified to form the mantle. Studies of the compositions 
of meteorites and of the behavior of metals at high pres¬ 
sure and temperature give a plausible picture of the com¬ 
position and formation of the core. It consists mainly of 
iron, with perhaps up to 10% nickel. The observed pres¬ 
sure-density relationships suggest that some less-dense 
non-metallic elements (Si, S, O) may be present in the 
outer core. It is not known whether small amounts of the 
more common radioactive elements ( 40 K, 232 Th, 235 U and 
238 U) are present in large enough abundances to con¬ 
tribute to the heat supply of the core. 

The core has a radius of 3480 km and consists of a 
solid inner core (layer G in Bullen’s Earth Model A) sur¬ 
rounded by a liquid outer core (Bullen layer E) that is 
1220 km thick. The transitional layer (F) was modelled 
by Bullen as a zone in which the P-wave velocity gradi¬ 
ent is negative (i.e., a decreases with increasing depth). 
Not all seismologists agreed with this interpretation of 
the t -A curves and so the nature of layer F remained 
controversial for many years. The need for a layer F has 


now been discarded. Improved seismographic resolution 
has yielded a large quantity of high-quality data for 
reflections from the boundaries of the outer core (PcP, 
ScS) and the inner core (PKiKP) which have helped 
clear up the nature of these boundaries. The PKiKP 
phase contains high frequencies; this implies that the 
inner core boundary is sharp, probably no more than 5 
km thick. The seismic events earlier interpreted as due to 
a layer F are now regarded as rays that have been scat¬ 
tered by small-scale features at the bottom of the 
mantle. 

The inner core transmits P-waves (PKIKP phase) but 
S-waves in the inner core (PKJKP phase), although in 
principle possible, have not yet been observed unequivo¬ 
cally. Body-wave travel-times do not constrain the rigidity 
of the inner core. The amplitude spectrum of the frequen¬ 
cies of higher modes of the Earth’s free oscillations show 
that the inner core is likely solid. Flowever, it is possible 
that it is not completely solid. Rather, it may be a mixture 
of solid and liquid phases at a temperature close to the 
solidification temperature. An analogy can be made with 
the mushy, semi-frozen state that water passes through on 
freezing to ice. 

The outer core is fluid, with a viscosity similar to that 
of water. It is assumed to be homogeneous and its 
thermal state is supposed to be adiabatic. These are the 
conditions to be expected in a fluid that has been well 
mixed, in the Earth’s case by convection and differential 
rotation. One theory of core dynamics holds that the 
iron-rich inner core is solidifying from the fluid outer 
core, leaving behind its lighter elements. These constitute 
a less-dense, therefore gravitationally buoyant, fluid, 
which rises through the denser overlying liquid. This com¬ 
positional type of buoyancy could be an important con¬ 
tributor to convection in the outer core, and therefore to 
the dynamics of the core and generation of the Earth’s 
magnetic field. 

The core-mantle boundary (CMB) is also called the 
Gutenberg discontinuity. It is characterized by very large 
changes in body-wave velocities and is the most sharply 
defined seismic discontinuity. Seismic data show that the 
boundary is not smooth but has a topography of hills 
and valleys. Anomalies in the travel-times of PKKP 
phases - which are reflected once internally at the CMB - 
have been attributed to scattering by topographic fea¬ 
tures with a relief of a few hundred meters. However, 
depending on conditions in the hot D" layer of the lower 
mantle immediately above the CMB some topographic 
features may be up to 10 km high. Interference between 
the CMB topography and fluid motions in the outermost 
core may couple the core and mantle to each other 
dynamically. 

3.7.6 Seismic tomography 

A free translation of the term tomography is “represen¬ 
tation in cross-section.” Neighboring two-dimensional 
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cross-sections can be combined to give a three- 
dimensional model. The use of computer-aided tomog¬ 
raphy (CAT) in medical diagnosis is well known as a 
non-invasive method of examining internal organs for 
abnormal regions. X-rays or ultrasonic rays are 
absorbed unequally by different materials. CAT consists 
of studying the attenuation of x-rays or ultrasonic waves 
that pass through the body in distinctly controlled 
planar sections. Seismic tomography uses the same prin¬ 
ciples, with the difference that the travel-times of the 
signals, as well as their attenuation, are observed. Hence 
the technique may be described as the three-dimensional 
modelling of the velocity or attenuation distribution 
of seismic waves in the Earth. The technique requires 
powerful computational facilities and sophisticated 
programming. 

The travel-time of a seismic wave from an earthquake 
focus to a seismograph is determined by the velocity dis¬ 
tribution along its path. For an idealized, spherically 
symmetric Earth model the radial distributions of veloc¬ 
ity are known. The velocities in the model are mean 
values, which average out lateral fluctuations. If such a 
velocity model were used to compute travel-times of 
different phases to any epicentral distance, a set of 
curves indistinguishable from Fig. 3.77 would result. In 
reality, the observed travel-times usually show small 
deviations from the calculated times. These discrepan¬ 
cies are called travel-time residuals or anomalies , and 
they can have several causes. An obvious cause is that the 
focal depth of an earthquake is not zero, as assumed in 
Fig. 3.77, but may be up to several hundred kilometers. 
The parametrized Earth model iasp91 takes this into 
account and tabulates travel-times for several focal 
depths. Clearly, precise determination of earthquake 
focal parameters (epicentral location, depth and time of 
occurrence) are essential prerequisites for seismic 
tomography. An important factor is the assumption of 
spherical symmetry, which is not perfectly valid, and so 
the ellipticity of the Earth's figure must be taken into 
account. 

A local source of travel-time residuals is the particular 
velocity-depth distribution under an observational 
network. Ideally, the local vertical velocity profile should 
be known, so as to allow compensation of an observed 
travel-time for local anomalous structure. In practice, the 
signals from a selected earthquake are averaged for 
several stations in a given area of the surface (e.g., about 
3° square) to reduce local perturbations. 

The lateral variations in P- and S-wave velocity at any 
given depth may amount to a few percent of the 
“average” velocity for that depth assumed in the reference 
model. If, at a certain depth, a seismic ray passes through 
a region in which the velocity is slightly faster than 
average, the wave will arrive slightly sooner than expected 
at the receiver; if the anomalous velocity is slower than 
average, the wave will arrive late. This permits classifica¬ 
tion of travel-time as “early” or “late” depending on 


whether a ray has traversed a region that is “fast” or 
“slow” with respect to the assumed model. 

The velocity of a seismic wave is determined by 
elastic parameters and density, which are affected by 
temperature. Thus, the velocity anomalies obtained 
from seismic tomography on a global scale are generally 
interpreted in terms of abnormal temperature and rigid¬ 
ity. A “slow” region is commonly associated with above- 
average temperature and lower rigidity, while a “fast” 
region is attributed to lower temperature and higher 
rigidity. 

3.7.6.1 Travel-time residuals and velocity anomalies 

The velocity distribution in the Earth is more heteroge¬ 
neous than in a standard velocity-model. The time (t) 
taken for a seismic ray to travel along the path from 
an earthquake to a recording station thus differs from 
the value ( t 0 ) predicted by the model. The difference 
(At = t 0 - t) is a travel-time anomaly. A travel-time residual 
(A t/t 0 ) for the path is obtained by expressing the anomaly 
as a percentage of the expected travel-time (Box 3.4). If a 
given path is subdivided into segments with different 
velocities, the travel-time is equivalent to the sum of the 
travel-times through the individual segments. This is like 
an equation with one known value (A t/t 0 ) and several 
unknown terms (the velocity anomalies in each segment). 
For a given earthquake several seismic stations may 
record rays that traverse the region of interest. Each 
station may also have recorded rays that crossed the 
region from different earthquakes. A large set of travel¬ 
time residuals results, equivalent to a large number of 
equations. A mathematical procedure called matrix inver¬ 
sion is used to solve the set of equations and obtain the 
velocity anomalies. The sophisticated analysis involves 
intensive data processing, and is beyond the scope of this 
text. However, Box 3.5 illustrates for a simple pattern of 
travel-time residuals (Box 3.4, Fig. B3.4) how a distribu¬ 
tion of velocity anomalies may be deduced by back pro¬ 
jection. In this method successive adjustments are made 
to an initial velocity distribution to account for the 
observed travel-time residuals. 

Tomographic imaging of the Earth’s three-dimen¬ 
sional velocity distribution is able to resolve small 
differences in seismic velocities. In some studies the 
signals are of local origin, generated by earthquakes or 
explosions within or near the volume of interest. Other 
studies are based on teleseismic signals, which originated 
in earthquakes at more than 20° epicentral distance. The 
analysis requires precise location of the earthquake 
source, corrections for local effects (such as crustal struc¬ 
ture close to each measurement station), and reconstruc¬ 
tion of the seismic rays between source and seismometer. 
Seismic tomography can be based on either body waves 
or surface waves. The database for P-wave studies con¬ 
sists of hundreds of thousands of first arrival times from 
earthquakes that occurred in the past 30 years, which 
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Box 3.4: Calculation of travel-time residuals 


Consider the passage of seismic rays in six directions 
through a square region containing four equal areas, 
each a square with edge 5 km, in which the P-wave 
velocities ( V ) are 4.9 km s _1 , 5.0 km s -1 , 5.1 km s _1 
and 5.2 km s -1 (Fig. B3.4). Let the expected velocity 
( V Q ) throughout the region be 5.0 km s -1 . The velocity 
difference (A V) in each area is found by subtracting 
the reference value, and the velocity anomaly (A V/V Q ) 
is obtained by expressing the difference as a percent¬ 
age of the expected value. This gives zones that are 2% 
and 4% fast, a zone that is 2% slow and a zone with no 
anomaly. 

Suppose that six seismic rays traverse the square 
region as in Fig. B3.4. If the velocity were 5.0 km s _1 
in each area, the expected travel-time (? 0 ) would be 
2.0 s for each of the horizontal and vertical rays and 
2 a/ 2 = 2.828 s for the longer diagonal rays. Flowever, 
the real velocity is different in each area. As a result, 
some of the observed travel-times are shorter and 
some are longer than the expected value. A travel-time 
anomaly (At) is computed by subtracting the observed 
travel-times (t) from the expected value (Fig. B3.4d). 
The travel-time residual (A tlt 0 ) is obtained by express¬ 
ing the anomaly as a percentage of the expected travel- 
time (Fig. B3.4e). Note that the residuals are not 
simple averages of the velocity anomalies along each 
path. 
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Fig. B3.4 Computation of relative travel-time residuals (in 
percent) for a simple four-block structure (modified after 
Kissling, 1993). 


have been catalogued by the International Seismological 
Center. In addition, S-wave models have been derived for 
data sets from digital broadband seismic stations. The 
velocity structures of both P-waves and S-waves have 
been obtained for global models of the entire mantle as 
well as for regional studies (e.g., individual subduction 
zones). 

3.7.62 Mantle tomography 

The inversion of body-wave data provides evidence for 
the lateral variations in velocity in the deeper interior. The 
lateral variations at a given depth are equivalent to the 
variations on the surface of a sphere at that depth, and so 
can be depicted with the aid of spherical harmonic func¬ 
tions. The contoured velocity anomalies obtained by 
spherical harmonic analysis for a depth of 2500 km in the 
lower mantle are dominated by a ring of fast P-wave 
velocities around the Pacific and a slow-velocity region in 
the center (Fig. 3.88). Slow velocities are also present at 
depth under Africa. The pattern is present at nearly all 
depths in the lower mantle (i.e., below 1000 km). This is 
shown clearly by a vertical cross-section along a profile 
around the equator (Fig. 3.89, bottom frame). The slow 
velocities are interpreted as the expression of warm 
mantle material that may be rising from the core-mantle 
boundary as so-called “super-plumes.” The role of these 
super-plumes in the circulation pattern of mantle convec¬ 
tion is not yet understood. Deep fast-velocity (“cold”) 
regions under America and Indonesia and slow-velocity 
(“warm”) regions under the Pacific and Atlantic oceans 
extend from about 1000 km depth to the core-mantle 
boundary. The lower mantle velocity anomalies show 
little correlation with plate tectonic elements, which are 
features of the lithosphere. 

Velocity anomalies in the upper mantle, on the other 
hand, are clearly related to plate tectonic features. The 
upper mantle velocity structures can be modelled by 
inverting teleseismic body waves, but these depths can 
also be probed by long-period surface waves, which are 
more sensitive to variations in rigidity (and thus tempera¬ 
ture). The inversion of long-period surface-wave data is 
an effective technique for modelling upper mantle S-wave 
velocities, especially under oceanic areas, where the reso¬ 
lution of body-wave anomalies is poor. The pattern of S- 
wave velocity anomalies in the upper mantle along the 
equatorial cross-section (Fig. 3.89, middle frame) shows 
generally elevated velocities under the “cold” continents 
and reduced velocities under the “warm” oceanic ridge 
systems. In these analyses the large local variations in S- 
wave velocities in the heterogeneous crust constitute an 
impediment to greater resolution. 

Important advances in understanding the geodynami- 
cal state of the Earth’s mantle have been made as a result 
of numerous regional seismic tomography studies, which 
have focused on specific tectonic environments such 
as zones of tectonic plate convergence. In particular, 
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Box 3.5: Calculation of velocity anomalies 


To illustrate how a velocity structure can be deduced 
from observed travel-time residuals we take as starting 
point the set of travel-time residuals (A t/t 0 ) computed 
for the velocity distribution in Box 3.4. The expected 
travel-time along a seismic path of length L is t 0 = L/V 0 , 
where V 0 is the reference velocity. If the true velocity is 
V= V 0 + A V, the observed travel-time is 
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Fig. B3.5 Backward projection of relative travel-time residuals to 
obtain velocity anomalies (modified after Kissling, 1993). 


Next, we assume that the percentage velocity anom¬ 
alies (A VIV Q ) are the same as the percentage travel-time 
residuals. As shown by Eq. (2), this is not strictly true; 
the discrepancy is of the order of (A V/V 0 ). However, the 
velocity anomalies are usually very small (see Figs. 
3.88-3.90), so assuming that the velocity anomaly is 
equal to the travel-time residual is a reasonable approxi¬ 
mation. 

Consider a horizontal ray that traverses the upper 
two “fast” blocks in Fig. B3.5. To account for the (early) 
travel-time anomaly of + 2.9% let each upper block be 
allocated a (fast) velocity anomaly of + 2.9%. Similarly, 
let each of the bottom two blocks be allocated velocity- 
anomalies of -1.02%. This simple velocity distribution 
(Fig. B3.5a) satisfies the travel-time anomalies for the 
horizontal rays. However, in the vertical direction it 
gives travel-time anomalies of + 0.94% (the mean of 


+ 2.90% and -1.02%) for each ray (Fig. B3.5b). This 
does not agree with the observed anomalies for the two 
vertical rays (+ 0.98% and + 0.90%, respectively); one 
vertical anomaly is 0.04% too large, the other is 0.04% 
too small. The velocities in the blocks are adjusted 
accordingly by making a correction of + 0.04% to the 
left-hand blocks and -0.04% to the right-hand blocks 
(Fig. B3.5c). This gives a new distribution of velocity 
anomalies which satisfies the horizontal and vertical 
rays (Fig. B3.5d). 

The model now gives travel-time anomalies along 
each of the diagonal rays of + 0.94%, compared to 
observed anomalies of + 1.92% and -0.04%, respec¬ 
tively (Fig. B3.5e). Further corrections of + 0.98% are 
now made to the upper right and lower left blocks, and 
-0.98% to the lower right and upper left blocks (Fig. 
B3.5f). The resulting distribution of velocity anomalies 
(Fig. B3.5g) satisfies all six rays through the anomalous 
region, and is close to the original distribution of veloc¬ 
ity anomalies (Fig. B3.4b). 


investigations of subduction zones have provided new 
insights into the effects on mantle convection of the 
seismic discontinuities at 400 km and 670 km depth. As 
discussed in more depth in Section 4.2.9.3, two models of 
mantle convection are prevalent. They differ mainly in 
the roles played by these seismic discontinuities. In the 
first - whole mantle convection - the entire mantle par¬ 
ticipates in the convection process. In the second - 
layered convection - the seismic discontinuities bound a 
transition zone that separates a system of convection 
cells in the upper mantle from a system of convection 
cells in the lower mantle. 

However, the reality is more complex, and neither 
model satisfies all the observational constraints. At a sub¬ 
duction zone, the subducting plate is colder and denser 
than the overriding plate and sinks into the mantle (Fig. 
3.90). Evidence from regional seismic tomography shows 
that, at some subduction zones (e.g., the Aegean arc, Fig. 


3.90a) the subducting cold slab is able to penetrate 
through the transition zone and to sink deeper into 
the lower mantle. In other cases (e.g., the Kurile arc, 
Fig. 3.90b) the 670 km discontinuity appears to block the 
downward motion of the subducting slab, which is then 
caused to deflect horizontally. Some slabs that penetrate 
the 670 km discontinuity appear to sink deep into the 
lower mantle. Some may in fact reach the core-mantle 
boundary. 

Seismic tomography is a potent technique for describ¬ 
ing conditions in the mantle. Many questions, such as the 
nature of the “super-plume” structures beneath the 
Pacific and Africa, remain unanswered. Despite convec¬ 
tive mixing there may be more heterogeneity in mantle 
composition than is often supposed. Indeed, the assump¬ 
tion that fast and slow seismic velocities imply cold 
and hot temperatures, respectively, may be overly simplis¬ 
tic. Yet, seismic tomography is the best tool currently 
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Fig. 3.88 Map of P-wave 
velocity anomalies in the 
lower mantle (2500 km 
depth). The deviations are 
plotted as percent faster or 
slower than the reference 
velocity at this depth (after 
Dziewonski, 1984, 1989). 



Fig. 3.89 Seismic 
tomographic section through 
the mantle along an 
equatorial profile. Middle 
frame'. S-wave anomalies in 
the upper mantle to a depth 
of 670 km. Bottom frame: P- 
wave anomalies in the lower 
mantle in depths between 
670 and 2890 km (after 
Woodhouse and Dziewonski, 
1984). 
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available for investigating geodynamic processes in the 
Earth’s interior. 
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Fig. 3.90 Seismic 
tomographic sections 
showing different styles of 
subduction. (a) At the Aegean 
arc the northward subducting 
African plate sinks through 
the 410 km and 660 m 
seismic discontinuities, while 
(b) at the southern Kurile arc 
the westward subducting 
Pacific plate appears to be 
unable to penetrate into the 
lower mantle and deflects 
horizontally (after Karason 
and Van der Hilst, 2000). 
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3.9 REVIEW QUESTIONS 

1. Describe the principle of the seismometer. 

2. Describe the particle motions relative to the direction 
of propagation of the two seismic body waves and the 
two seismic surface waves. 

3. What are the Lame constants in seismic theory? 

4. What is meant by dispersion with regard to the propa¬ 
gation of surface waves? 

5. Sketch how the train of Rayleigh waves from a 
distant earthquake would appear on a seismogram. 

6. What are the free oscillations of the Earth? What 
kinds are possible? What is meant by the normal 
modes ? What are higher modes ? How do they relate to 
surface waves? 

7. How does the elastic rebound model explain the 
origin of a tectonic earthquake? 

8. What is the epicenter of an earthquake? Explain 
how the epicenter of an earthquake can be located? 
What is the minimum number of seismic records 
needed? 

9. Explain the difference between earthquake intensity 
and magnitude. 

10. If the magnitude of an earthquake is 0.5 greater than 
that of another, how much greater is the amount of 
energy it releases? 

11. How does a tsunami originate? Why are tsunami 
barely noticeable over the open ocean but very dan¬ 
gerous near shore? 

12. Describe the geographical distribution of the Earth’s 
seismically active zones. 

13. Describe with the aid of sketches the distribution of 
earthquakes at the three major plate boundaries: 
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(a) spreading ridge, (b) transform fault, and (c) sub- 
duction zone. 

14. Sketch the fault-plane solutions that characterize 
earthquakes at each type of plate boundary. Describe 
their significance with regard to plate tectonic 
motions. 

15. How do the distribution and focal solutions of earth¬ 
quakes at a transform fault differ from those at a 
strike-slip fault? 

16. What is the Mohoroviqi? discontinuity? What is the 
seismic evidence for this feature? What is its average 
depth under continents, under oceans, and for the 
entire world? 

17. How do seismic wave velocities change at the major 
discontinuities in the Earth’s internal structure? How 
are these discontinuities characterized? 

18. What is the brittle-ductile transition in the Earth? 
What physical properties determine the depth and 
nature of this transition? 

19. On a cross-section of the Earth, sketch the paths of 
the following seismic rays: (i) PKP, (ii) SKS, (iii) PcP, 
(iv) PPP. 

20. What is a PKIKP wave? Describe the refraction of 
this wave at each discontinuity it crosses. 

21. What is the critical distance in seismic refraction sur¬ 
veying? What is the crossover distance? What is a head 
wave ? What are supercritical reflections? 

22. A seismic survey is conducted over level ground con¬ 
sisting of two horizontal layers. Sketch a travel-time 
diagram that shows the arrivals of the direct wave, 
the reflected wave, and the doubly refracted wave. 
How can the seismic velocity of each layer be deter¬ 
mined from the diagram? 

23. What is normal move-out in seismic reflection pro¬ 
filing? 

24. What is meant by the migration of reflection seismic 
records? Why is it necessary? 

25. Describe the split-spread and common-mid-point 
methods in reflection profiling? What is achieved by 
each of these methods? 


3.10 EXERCISES 

1. Calculate the bulk modulus (K), the shear modulus 
(/x) and Poisson’s ratio (u) for the lower crust, upper 
mantle and lower mantle, respectively, using Eqs. 
(3.153) and (3.156) and the values for the P-wave (a) 
and S-wave (/3) velocities, and density (p) given in the 
following table. 


Region 

Depth 

[km] 

a 

[km s- 1 ] 

P 

[kms '] 

[kgm 3 ] 

Lower crust 

33 

7.4 

4.8 

3100 

Upper mantle 

400 

8.5 

12.2 

3900 

Lower mantle 

2200 

12.2 

7.0 

5300 


2. The table below gives the densities and seismic P- 
and S-wave velocities at various depths in the 
Earth. 


Depth 

[km] 

Density 
[kg m 3 ] 

P-wave 
[km s- 1 ] 

S-wave 
[km s- 1 ] 

100 

3380 

8.05 

4.45 

500 

3850 

9.65 

5.22 

1000 

4580 

11.46 

6.38 

2000 

5120 

12.82 

6.92 

2890 

5560 

13.72 

7.27 

2900 

9900 

8.07 

0 

4000 

11320 

9.51 

0 

5000 

12120 

10.30 

0 

5500 

12920 

11.14 

3.58 

6470 

13090 

11.26 

3.67 


(a) From these quantities calculate the rigidity 
modulus, p,, bulk modulus, K, and Poisson's 
ratio, v, at each depth. 

(b) Discuss in your own words the information that 
these data give about the deep interior of the 
Earth. 

3. A strong earthquake off the coast of Japan sets off a 
tsunami that propagates across the Pacific Ocean 
(average depth <7=5 km). 

(a) Calculate the velocity of the wave in km hr -1 and 
the corresponding wavelength, when the wave 
has a dominant period of 30 min. 

(b) How long does the wave take to reach Hawaii, 
which is at an angular distance of 54° from the 
epicenter? 

4. The dispersion relation between frequency co and 
wave number k of seismic water waves for water 
depth <7is (Box 3.3) 

a> 2 = gktanh(kd) 

(a) Modify this expression for wavelengths that are 
much shorter than the water depth. 

(b) Determine the phase velocity of these waves. 

(c) Show that the group velocity of the waves is half 
the phase velocity. 

5. In a two-layer Earth the mantle and core are each 
homogeneous and the radius of the core is one-half 
the radius of the Earth. Derive a formula for the 
travel-time curve for the arrival time t of the 
phase PcP at epicentral distance A. Verify the 
formula for the maximum possible value of A in this 
model. 

6. Why might one expect an interface with a small 
critical angle to be a good reflector of seismic energy? 

7. The P-wave from an earthquake arrives at a seismo¬ 
graph station at 10:20 a.m. and the S-wave arrives at 
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10:25 a.m. Assuming that the P-wave velocity is 5 
km s -1 and that Poisson's ratio is 0.25, compute the 
time at which the earthquake occurred and its epi- 
central distance in degrees from the seismograph 
station. 

8. The following table gives arrival times of P-waves (r p ) 
and S-waves (t s ) from a nearby earthquake: 


Recording 

Station 

Time of day 
[hr:min] 

‘pM 


A 

23:36 

54.65 

57.90 

B 

23:36 

57.34 

62.15 

C 

23:37 

00.49 

07.55 

D 

23:37 

01.80 

10.00 

E 

23:37 

01.90 

10.10 

F 

23:37 

02.25 

10.70 

G 

23:37 

03.10 

12.00 

H 

23:37 

03.50 

12.80 

I 

23:37 

06.08 

18.30 

J 

23:37 

07.07 

19.79 

K 

23:37 

08.32 

21.40 

L 

23:37 

11.12 

26.40 

M 

23:37 

11.50 

26.20 

N 

23:37 

17.80 

37.70 


(a) Plot the arrival-time differences (t - tj against 

h P 

the arrival times of the P-wave to produce a 
Wadati diagram. 

(b) Determine the ratio a//3 of the seismic velocities. 

(c) Determine the time of occurrence (? 0 ) of the 
earthquake. 

9. A plane seismic wave, travelling vertically downwards 
in a rock of density 2200 kg m -3 with seismic velocity 
2000 m s -1 , is incident on the horizontal top surface 
of a rock layer of density 2400 kg m -3 and seismic 
velocity 3300 m s -1 . 

(a) What are the amplitude ratios of the transmitted 
and reflected waves? 

(b) What fraction of the energy of the incident wave 
is transmitted into the lower medium? 

10. A plane seismic wave travels vertically downwards at a 
velocity of 4800 m s _1 through a salt layer with density 
2100 kg m -3 . The wave is incident upon the top 
surface of a sandstone layer with density 2400 kg m -3 . 
The phase of the reflected wave is changed by 180° and 
the reflected amplitude is 2% of the incident ampli¬ 
tude. What is the seismic velocity of the sandstone? 

11. (a) Calculate the minimum arrival times for seismic 
reflections from each of the reflecting interfaces in 
the following section. Consider the base of the lower¬ 
most bed to be a reflector as well. 

(b) What is the average velocity of the section for a 
reflection from the base of the dolomite? 

(c) Using the listed densities calculate the reflection 
coefficient for each interface (except the base of 
the dolomite). Which interface gives the strongest 


Formation 

Density 

[kgnT 3 ] 

Thickness 

[m] 

Formation velocity 
[ms -1 ] 

Alluvium 

1500 

150 

600 

Shale 

2400 

450 

2700 

Sandstone 

2300 

600 

3000 

Limestone 

2500 

900 

5400 

Salt 

2200 

300 

4500 

Dolomite 

2700 

600 

6000 


reflection and which the weakest? At which inter¬ 
faces does a change in phase occur? What does 
this mean? 

12. A reflection seismic record in an area of relatively flat 
dips gave the following data: 


Distance shot-point 
to detector [m] 

Travel-time (tj), 

1 st reflection [s] 

Travel-time (t 2 ), 
2nd reflection [s] 

30 

1.000 

1.200 

90 

1.002 

1.201 

150 

1.003 

1.201 

210 

1.007 

1.202 

270 

1.011 

1.203 

330 

1.017 

1.205 

390 

1.023 

1.207 


(a) Plot the t-x curves for these reflections to show 
the “moveout” effect. 

(b) On a different graph, plot the t 2 -x 2 curves (i.e., 
squared data) for the reflections. 

(c) Determine the average vertical velocity from the 
surface to each reflecting bed. 

(d) Use these velocities to compute the depths to the 
reflecting beds. 

13. The following table gives two-way travel-times of 
seismic waves reflected from different reflecting inter¬ 
faces in a horizontally layered medium. 


Geophone to 
shot-point 
distance [m] 


Travel-time [s] to 

First 

reflector 

Second 

reflector 

Third 

reflector 

500 

0.299 

0.364 

0.592 

1000 

0.566 

0.517 

0.638 

1500 

0.841 

0.701 

0.708 

2000 

1.117 

0.897 

0.799 

2500 

1.393 

1.099 

0.896 


(a) Draw a plot of (travel-time) 2 against (distance) 2 . 

(b) Determine the vertical two-way travel-time 
(‘'echo-time”) and average velocity to each 
reflecting interface. 

(c) Compute the depth of each reflector and the 
thickness of each layer. 

(d) Compute the true velocity (interval velocity) of 
each layer. 
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(e) Verify your results by computing the total verti¬ 
cal travel-time for a wave reflected from the 
deepest interface. 

14. Assume the horizontally layered structure from the 
previous problem. 

(a) If a seismic ray leaves the surface at an angle of 
15° to the vertical, how long does it take to return 
to the surface after reflecting from the basement? 

(b) At what horizontal distance from the shot-point 
does this ray reach the surface? 

15. Assume that the three horizontal homogeneous rock 
layers in the previous problems have densities of 
1800, 2200, and 2500 kg m -3 respectively. The lowest 
layer overlies basement with velocity 5.8 km s -1 and 
density 2700 kg m~ 3 . 

(a) Compute the reflection and transmission coeffi¬ 
cients at each interface for a plane P-wave travel¬ 
ling vertically downwards. 

(b) Calculate what fraction of the initial energy of 
the wave is transmitted into the basement. 

(c) Calculate the fraction of the initial energy carried 
in the reflection that returns to the surface from 
the basement. 

16. An incident P-wave is converted into refracted and 
reflected P- and S-waves at an interface. Calculate all 
the critical angles in the following three cases, where 
a and /3 are the P-wave and S-wave velocities, respec¬ 
tively: 


Layer 

Seismic 

wave 

Case (a) 
[km s- 1 ] 

Case (b) 
[km s- 1 ] 

Case (c) 
[km s- 1 ] 

Above interface 

a 

3.5 

4.0 

5.5 


(i 

2.0 

2.3 

3.1 

Below interface 

a 

8.5 

6.0 

7.0 


p 

5.0 

3.5 

4.0 


17. An incident P-wave is converted into refracted and 
reflected P- and S-waves at an interface that is 
inclined at 20° to the horizontal, as in the figure 
below. The respective P- and S-wave velocities are 
5 km s -1 and 3 km s -1 above the interface and 



7 km s -1 and 4 km s _1 below the interface. If the 
incident P-wave strikes the interface at an angle of 
40° to the horizontal, calculate the angles to the hori¬ 
zontal made by the reflected and refracted P- and 
S-waves. 

18. A seismic refraction survey gave the following data 
for the first arrival times at various distances from the 
shot-point. 


Distance [km] 

Time [s] 

Distance [km] 

Time [s] 

3.1 

1.912 

13.1 

6.678 

5.0 

3.043 

14.8 

7.060 

6.5 

3.948 

16.4 

7.442 

8.0 

4.921 

18.0 

7.830 

9.9 

5.908 

19.7 

8.212 

11.5 

6.288 




(a) Plot the travel-time curve for the first arrivals. 

(b) Calculate the seismic velocities of the layers. 

(c) Calculate the minimum depth to the refracting 
interface. 

(d) Calculate the critical angle of refraction for the 
interface. 

(e) Calculate the critical distance for the first arrival 
of refracted rays. 

(f) Calculate the crossover distance beyond which 
the first arrivals correspond to head waves. 

19. A seismic refraction survey is carried out over a 
layered crust with flat-lying interfaces. In one case 
the crust is homogeneous and 30 km thick with a P- 
wave velocity 6 km s -1 and overlies mantle with P- 
wave velocity of 8 km s _1 . In the other case the crust 
consists of an upper layer 20 km thick with P-wave 
velocity 6 km s -1 overlying a lower layer 10 km 
thick with P-wave velocity 5 km s _1 . The upper 
mantle P-wave velocity is again 8 km s _1 . On the 
same graph, plot the first arrival time curves for the 
two cases. What is the effect of the low-velocity 
layer on the estimation of depth to the top of the 
mantle? 

20. The table below gives up-dip and down-dip travel- 
times of P-wave arrivals for refraction profiles over an 
inclined interface. The geophones are laid out in a 
straight line passing through the alternate shot-points 
A and B, which are 2700 m apart on the profile. 

(a) Plot the travel-time curves for each shot-point. 

(b) Calculate the true velocity of the upper layer. 

(c) Calculate the apparent velocities of the layer 
below the refractor. 

(d) In which direction does the refracting interface 
dip? 

(e) What is the angle of dip of the interface? 

(f) What is the true velocity of the layer below the 
refractor? 
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Distance from 

Travel-time [s] 


(g) What are the closest distances to the refractor 
below A and B? 



shot-point [m] 

from A 

from B 

(h) What are the vertical depths to the refractor 

300 

0.139 

0.139 

below A and B? 

600 

0.278 

0.278 


900 

0.417 

0.417 


1200 

0.556 

0.556 


1500 

0.695 

0.695 


1800 

0.833 

0.833 


2100 

0.972 

0.972 


2400 

1.085 

1.111 


2700 

1.170 

1.170 


3000 

1.255 

1.223 


3300 

1.339 

1.276 


3600 

1.424 

1.329 




4 Earth's age, thermal and electrical properties 


4.1 GEOCHRONOLOGY 

4.1.1 Time 

Time is both a philosophical and physical concept. Our 
awareness of time lies in the ability to determine which of 
two events occurred before the other. We are conscious of 
a present in which we live and which replaces continually a 
past of which we have a memory; we are also conscious of 
a future, in some aspects predictable, that will replace the 
present. The progress of time was visualized by Sir Isaac 
Newton as a river that flows involuntarily at a uniform 
rate. The presumption that time is an independent entity 
underlies all of classical physics. Although Einstein’s 
Theory of Relativity shows that two observers moving rel¬ 
ative to each other will have different perceptions of time, 
physical phenomena are influenced by this relationship 
only when velocities approach the speed of light. In every¬ 
day usage and in non-relativistic science the Newtonian 
notion of time as an absolute quantity prevails. 

The measurement of time is based on counting cycles 
(and portions of a cycle) of repetitive phenomena. 
Prehistoric man distinguished the differences between day 
and night, he observed the phases of the Moon and was 
aware of the regular repetition of the seasons of the year. 
From these observations the day, month and year 
emerged as the units of time. Only after the development 
of the clock could the day be subdivided into hours, 
minutes and seconds. 

4.1.1.1 The clock 

The earliest clocks were developed by the Egyptians and 
were later introduced to Greece and from there to Rome. 
About 2000 BC the Egyptians invented the water clock 
(or clepsydra). In its primitive form this consisted of a 
container from which water could escape slowly by a 
small hole. The progress of time could be measured by 
observing the change of depth of the water in the con¬ 
tainer (using graduations on its sides) or by collecting and 
measuring the amount of water that escaped. The 
Egyptians (or perhaps the Mesopotamians) are also cred¬ 
ited with inventing the sundial. Early sundials consisted 
of devices - poles, upright stones, pyramids or obelisks - 
that cast a shadow; the passage of time was observed by 
the changing direction and length of the shadow. After 


Earth’s 

orbit 



Fig. 4.1 The sidereal day is the time taken for the Earth to rotate 
through 360° relative to a fixed star; the solar day is the time taken for a 
rotation between meridians relative to the Sun. This is slightly more 
than 360° relative to the stars, because the Earth is also orbiting the 
Sun. 

trigonometry was developed dials could be accurately 
graduated and time precisely measured. Mechanical 
clocks were invented around 1000 AD but reliably accu¬ 
rate pendulum clocks first came into use in the seven¬ 
teenth century. Accurate sundials, in which the shadow 
was cast by a fine wire, were used to check the setting and 
calibration of mechanical clocks until the nineteenth 
century. 

4.1.12 Units of time 

The day is defined by the rotation of the Earth about its 
axis. The day can be defined relative to the stars or to the 
Sun (Fig. 4.1). The time required for the Earth to rotate 
through 360° about its axis and to return to the same 
meridian relative to a fixed star defines the sidereal day. 
All sidereal days have the same length. Sidereal time must 
be used in scientific calculations that require rotational 
velocity relative to the absolute framework of the stars. 
The time required for the Earth to rotate about its axis 
and return to the same meridian relative to the Sun 
defines the solar day. While the Earth is rotating about its 
axis, it is also moving forward along its orbit. The orbital 
motion about the Sun covers 360° in about 365 days, so 
that in one day the Earth moves forward in its orbit by 
approximately 1°. To return to the solar meridian the 
Earth must rotate this extra degree. The solar day is there¬ 
fore slightly longer than the sidereal day. Solar days are 
not equal in length. For example, at perihelion the Earth 
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is moving faster forward in its orbit than at aphelion (see 
Fig. 1.2). At perihelion the higher angular rate about the 
Sun means that the Earth has to rotate through a larger 
than average angle to catch up with the solar meridian. 
Thus, at perihelion the solar day is longer than average; at 
aphelion the opposite is the case. The obliquity of the 
ecliptic causes a further variation in the length of the 
solar day. Mean solar time is defined in terms of the mean 
length of the solar day. It is used for most practical pur¬ 
poses on Earth, and is the basis for definition of the hour, 
minute and second. One mean solar day is equal to exactly 
86,400 seconds. The length of the sidereal day is approxi¬ 
mately 86,164 seconds. 

The sidereal month is defined as the time required for 
the Moon to circle the Earth and return to the celestial 
longitude of a given star. It is equal to 27.32166 (solar) 
days. To describe the motion of the Moon relative to the 
Sun, we have to take into account the Earth’s motion 
around its orbit. The time between successive alignments 
of the Sun, Earth and Moon on the same meridian is the 
synodic month. It is equivalent to 29.530 59 days. 

The sidereal year is defined as the time that elapses 
between successive occupations by the Earth of the same 
point in its orbit with respect to the stars. It is equal to 
365.256 mean solar days. Two times per year, in spring 
and autumn, the Earth occupies positions in its orbit 
around the Sun where the lengths of day and night are 
equal at any point on the Earth. The spring occurrence is 
called the vernal equinox; that in the autumn is the 
autumnal equinox. The solar year (correctly called the 
tropical year) is defined as the time between successive 
vernal equinoxes. It equals 365.242 mean solar days, 
slightly less than the sidereal year. The small difference 
(0.014 days, about 20 minutes) is due to the precession of 
the equinoxes, which takes place in the retrograde sense 
(i.e., opposite to the revolution of the Earth about the 
Sun) and thereby reduces the length of the tropical year. 

Unfortunately the lengths of the sidereal and tropical 
years are not constant but change slowly but measurably. 
In order to have a world standard the fundamental unit of 
scientific time was defined in 1956 in terms of the length 
of the tropical year 1900, which was set equal to 
31,556,925.9747 seconds of ephemeris time. Even this def¬ 
inition of the second is not constant enough for the needs 
of modern physics. Highly stable atomic clocks have been 
developed that are capable of exceptional accuracy. For 
example, the alkali metal cesium has a sharply defined 
atomic spectral line whose frequency can be determined 
accurately by resonance with a tuned radio-frequency 
circuit. This provides the physical definition of the second 
of ephemeris time as the duration of 9,192,631,770 cycles 
of the cesium atomic clock. 

Other units of time are used for specific purposes. 
Astronomers use a practical unit of time synchronized to 
the Earth's rotation. This gives a uniform timescale called 
universal time and denoted UT2; it is defined for a partic¬ 
ular year by the Royal Observatory at Greenwich, 


England. The particular ways of defining the basic units 
of time are important for analyzing some problems in 
astronomy and satellite geodesy, but for most geophysical 
applications the minor differences between the different 
definitions are negligible. 

4.1.1.3 The geological timescale 

Whereas igneous rocks are formed in discrete short-lived 
eruptions of magma, sequences of sedimentary rocks 
take very long periods of time to form. Many sedimen¬ 
tary formations contain fossils, i.e., relicts of creatures 
that lived in or near the basin in which the sediments were 
deposited. Evolution gives a particular identifiable char¬ 
acter to the fossils in a given formation, so that it is possi¬ 
ble to trace the evolution of different fossil families and 
correlate their beginnings and extinctions. These charac¬ 
teristics allow the formation of the host rock to be corre¬ 
lated with others that contain part or all of the same fossil 
assemblage, and permit the development of a scheme for 
dating sediments relative to other formations. Gradually 
a biostratigraphical timescale has been worked out, which 
permits the accurate determination of the relative age of a 
sedimentary sequence. This information is intrinsic to any 
geological timescale. 

A geological timescale combines two different types of 
information. Its basic record is a chronostratigraphical 
scale showing the relationship between rock sequences. 
These are described in detail in stratigraphical sections 
that are thought to have continuous sedimentation and 
to contain complete fossil assemblages. The boundary 
between two sequences serves as a standard of reference 
and the section in which it occurs is called a boundary 
stratotype. When ages are associated with the reference 
points, the scale becomes a geological timescale. Time is 
divided into major units called eons, which are subdi¬ 
vided into eras; these in turn are subdivided into periods 
containing several epochs. The lengths of time units in 
early timescales, before ages were determined with 
radioactive isotopes, were expressed as multiples of the 
duration of the Eocene epoch. Modern geological 
timescales are based on isotopic ages, which are cali¬ 
brated in normal time units. 

The geological timescale is constantly being revised 
and updated. Improved descriptions of important bound¬ 
aries in global stratotype sections, more refined correla¬ 
tion with other chronostratigraphical units, and better 
calibration by more accurate isotopic dating lead to fre¬ 
quent revisions. Figure 4.2 shows an example of a current 
geological timescale. 

4.1.2 Estimating the Earth's age 

Early estimates of the Earth’s age were dominated by reli¬ 
gious beliefs. Some oriental philosophers in antiquity 
believed that the world had been in existence for millions of 
years. Yet, western thought on this topic was dominated 
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Fig. 4.2 A simplified version 
of the geological timescale of 
Harland eta!., 1990. 


Age (Ma) Age (Ma) Age (Ma) 



for centuries by the tenets of the Jewish and Christian 
faiths. Biblical estimates of the world’s age were made on 
the basis of genealogy by adding up the lengths of lifetimes 
and generations mentioned in the Old Testament. Some 
estimates also incorporate information from other ancient 
scriptures or related sources. The computed age of 
Creation invariably gave an age of the Earth less than 
10,000 yr. 

The best-known biblical estimate of the date of 
Creation was made by James Ussher (1581-1656), an Irish 
archbishop. His analysis of events recorded in the Old 
Testament and contemporary ancient scrolls led Ussher to 
proclaim that the exact time of Creation was at the begin¬ 
ning of night on October 22, in the year 4004 BC, which 
makes the age of the Earth approximately 6000 yr. Other 
biblical scholars inferred similar ages. This type of “cre¬ 
ationist” age estimate is still favored by many fundamen¬ 
talist Christians, whose faith is founded on literal 
interpretation of the Bible. However, biblical scholars 
with a more broadly based faith recognize age estimates 
based upon scientific methodology and measurement. 

In the late nineteenth century the growth of natural 
philosophy (as physics was then called) fostered calcula¬ 
tions of the Earth’s age from physical properties of the 
solar system. Among these were estimates based on the 
cooling of the Sun, the cooling of the Earth, and the slow 
increase in the Earth-Moon distance. Chemists tried to 
date the Earth by establishing the time needed for the seas 
to acquire their salinity, while geologists conjectured how 
long it would take for sediments and sedimentary rocks to 
accumulate. 


4.1.2.1 Cooling of the Sun 

Setting aside the biblical “sequence” chronicled in the 
book of Genesis, modern-day philosophers opine that the 
Earth cannot be older than the Sun. Cooling of the Sun 
takes place by continuous radiation of energy into space. 
The amount of solar energy falling on a square meter per 
second at the Earth’s distance from the Sun (1 AU) is 
called the solar constant; it equals 1360W m -2 . The 
amount of energy lost by the Sun per second is obtained 
by multiplying this value by the surface area of a sphere 
whose radius is one astronomical unit. This simple calcu¬ 
lation shows that the Sun is losing energy at the rate 
of 3.83 X 10 26 W. In the nineteenth century, before the dis¬ 
covery of radioactivity and nuclear reactions, the source 
of this energy was not known. A German scientist, 
H. L. F. von Helmholtz, suggested in 1856 that it might 
result from the change of potential energy due to gravita¬ 
tional condensation of the Sun from an originally more 
distended body. The condensational energy E s of a mass 
M s of uniform density and radius R s is given by 

3 K 

£’ s = |G-^ = 2. 28xl ° 41 J (4.1) 

In this equation G is the gravitational constant (see 
Section 2.2.1). The factor 3/5 in the result arises from the 
assumption of a uniform density distribution inside the 
Sun. Dividing the condensational energy by the rate of 
energy loss gives an age of 19 million years for the Sun. 
Allowing for the increase in density towards the center 
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causes an approximately threefold increase in the gravita¬ 
tional energy and the inferred age of the Sun. 

4.12.2 Cooling of the Earth 

In 1862 William Thomson, who later became Lord 
Kelvin, examined the cooling of the Sun in more detail. 
Unaware (although suspicious) of other sources of solar 
energy, he concluded that gravitational condensation 
could supply the Sun's radiant energy for at least 10 
million years but not longer than 500 million years 
(500 Ma). In the same year he investigated the cooling 
history of the Earth. Measurements in deep wells and 
mines had shown that temperature increases with depth. 
The rate of increase, or temperature gradient, was known 
to be variable but seemed to average about 1°F for every 
50 ft (0.036 °C m _1 ). Kelvin inferred that the Earth was 
slowly losing heat and assumed that it did so by the 
process of conduction alone (Section 4.2.4). This enabled 
him to deduce the Earth’s age from a solution of the one- 
dimensional equation of heat conduction (see Eq. 4.57 
and Box 4.2), which relates the temperature T at time t 
and depth z to physical properties of the cooling body, 
such as its density (p), specific heat (c) and thermal con¬ 
ductivity (k). 

These parameters are known for particular rock types, 
but Kelvin had to adopt generalized values for the entire 
Earth. The model assumes that the Earth initially had a 
uniform temperature T 0 throughout, and that it cooled 
from the outside leaving the present internal temperatures 
higher than the surface temperature. After a time t has 
elapsed the temperature gradient at the surface of the 
Earth is given by 



Kelvin assumed an initial temperature of 7000°F 
(3871°C, 4144 K) for the hot Earth, and contemporary 
values for the surface temperature gradient and thermal 
parameters. The calculation for t yielded an age of about 
100 Ma for the cooling Earth. 

4.1.2.3 Increase of the Earth-Moon separation 

The origin of the Moon is still uncertain. George FI. 
Darwin, son of the more famous Charles Darwin and 
himself a pioneer in tidal theory, speculated that it was 
torn from the Earth by rapid rotation. Like other classical 
theories of the Moon’s origin (e.g., capture of the Moon 
from elsewhere in the solar system, or accretion in Earth 
orbit) the theory is flawed. In 1898 Darwin tried to 
explain the Earth’s age from the effects of lunar tidal fric¬ 
tion. The tidal bulges on the Earth interact with the 
Moon’s gravitation to produce a decelerating torque that 
slows down the Earth’s rotation and so causes an increase 
in the length of the day. The equal and opposite reaction 


is a torque exerted by the Earth on the Moon’s orbit that 
increases its angular momentum. As explained in Section 
2.3.4.2, this is achieved by an increase in the distance 
between the Moon and the Earth and a decrease in the 
rotation rate of the Moon about the Earth, which 
increases the length of the month. The Earth's rotation 
decelerates more rapidly than that of the Moon, so that 
eventually the angular velocities of the Earth and Moon 
will be equal. In this synchronous state the day and 
month will each last about 47 of our present days and the 
Earth-Moon distance will be about 87 Earth radii; the 
separation is presently 60.3 Earth radii. 

Similar reasoning suggests that earlier in Earth’s 
history, when the Moon was much closer to the Earth, 
both bodies rotated faster so that an earlier synchronous 
state may be conjectured. The day and month would each 
have lasted about 5 of our present hours and the 
Earth-Moon distance would have been about 2.3 Earth 
radii. Flowever, at this distance the Moon would be inside 
the Roche limit, about 3 Earth radii, at which the Earth’s 
gravitation would tear it apart. Thus, it is unlikely that 
this condition was ever realized. 

Darwin calculated the time needed for the Earth and 
Moon to progress from an initially unstable close rela¬ 
tionship to their present separation and rotation speeds, 
and concluded that a minimum of 56 Ma would be 
needed. This provided an independent estimate of the 
Earth's age, but it is unfortunately as flawed in its underly¬ 
ing assumptions as other models. 

4.12.4 Oceanic salinity 

Several determinations of the age of the Earth have been 
made on the basis of the chemistry of sea water. The rea¬ 
soning is that salt is picked up by rivers and transported 
into lakes and seas; evaporation removes water but the 
vapor is fresh, so the salt is left behind and accumulates 
with time. If the accumulation rate is measured, and if the 
initial salt concentration was zero, the age of the sea and, 
by inference, the Earth can be calculated by dividing the 
present concentration by the accumulation rate. 

Different versions of this method have been investi¬ 
gated. The most noted is that of an Irish geologist, John 
Joly, in 1899. Instead of measuring salt he used the con¬ 
centration of a pure element, sodium. Joly determined the 
total amount of sodium in the oceans and the annual 
amount brought in by rivers. He concluded that the prob¬ 
able maximum age of the Earth was 89 Ma. Later esti¬ 
mates included corrections for possible sodium losses and 
non-linear accumulation but gave similar ages less than 
about 100 Ma. 

The principal flaw in the chemical arguments is the 
assumption that sodium accumulates continuously in the 
ocean. In fact, all elements are withdrawn from the ocean 
at about the same rate as they are brought in. As a result, 
sea water has a chemically stable composition, which is 
not changing significantly. The chemical methods do not 
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measure the age of the ocean or the Earth, but only the 
average length of time that sodium resides in the seas 
before it is removed. 

4. 1.2.5 Sedimentary accumulation 

Not to be outdone by the physicists and chemists, late 
nineteenth century geologists tried to estimate Earth’s age 
using stratigraphical evidence for the accumulation of 
sediments. The first step involved determining the thick¬ 
nesses of sediment deposited during each unit of geologi¬ 
cal time. The second step was to find the corresponding 
sedimentation rate. When these parameters are known, 
the length of time represented in each unit can be calcu¬ 
lated. The age of the Earth is the sum of these times. 

The geological estimates are fraught with complica¬ 
tions. Sediments with a silicate matrix, such as sandstones 
and shales, are deposited mechanically, but carbonate 
rocks form by precipitation from sea water. To calculate the 
mechanical rate of sedimentation the rate of input has to 
be known. This requires knowing the area of the deposi- 
tional basin, the area supplying the sediments and its rate 
of land erosion. Rates used in early studies were largely 
intuitive. The calculations for carbonate rocks required 
knowing the rate of solution of calcium carbonate from 
the land surfaces, but could not correct for the variation of 
solubility with depth in the depositional basin. The number 
of unknown or crudely known parameters led to numerous 
divergent geological estimates for the Earth’s age, ranging 
from tens of millions to hundreds of millions of years. 

4.1.3 Radioactivity 

In 1896 a French physicist, Henri Becquerel, laid a sample 
of uranium ore on a wrapped, undeveloped photographic 
plate. After development the film showed the contour of 
the sample. The exposure was attributed to invisible rays 
emitted by the uranium sample. The phenomenon, which 
became known as radioactivity, provides the most reliable 
methods yet known of dating geological processes and 
calculating the age of the Earth and solar system. To 
appreciate radioactivity we must consider briefly the 
structure of the atomic nucleus. 

The nucleus of an atom contains positively charged 
protons and electrically neutral neutrons. The electrostatic 
Coulomb force causes the protons to repel each other. It 
decreases as the inverse square of the separation (see 
Section 4.3.2) and so can act over distances that are large 
compared to the size of the nucleus. An even more power¬ 
ful nuclear force holds the nucleus together. This attractive 
force acts between protons, between neutrons, and between 
proton and neutron. It is only effective at short distances, 
for a particle separation less than about 3 X 10“ 15 m. 

Suppose that the nucleus of an atom contains Z 
protons and is surrounded by an equal number of nega¬ 
tively charged electrons, so that the atom is electrically 
neutral; Z is the atomic number of the element and defines 


its place in the periodic table. The number N of neutrons 
in the nucleus is its neutron number, and the total number 
A of protons and neutrons is the mass number of the 
atom. Atoms of the same element with different neutron 
numbers are called isotopes of the element. For example, 
uranium contains 92 protons but may have 142, 143, or 
146 neutrons. The different isotopes are distinguished by 
appending the mass number to the chemical symbol, 
giving 234 U, 235 U and 238 U. 

The Coulomb force of repulsion acts between every 
pair of protons in a nucleus, while the short-range nuclear 
force acts only on nearby protons and neutrons. To avoid 
flying apart due to Coulomb repulsion all nuclei with 
atomic number Z greater than about 20 have an excess of 
neutrons (A>Z). This helps to dilute the effects of the 
repulsion-producing protons. However, nuclei with Z> 
83 are unstable and disintegrate by radioactive decay. This 
means that they break up spontaneously by emitting ele¬ 
mentary particles and other radiation. 

At least 28 distinct elementary particles are known to 
nuclear physics. The most important correspond to the 
three types of radiation identified by early investigators 
and called a-, [3-, and y-rays. An a-ray (or a-particle) is a 
helium nucleus that has been stripped of its surrounding 
electrons; it is made up of two protons and two neutrons 
and so has atomic number 2 and mass number 4. A (3- 
particle is an electron. Some reactions emit additional 
energy in the form of y-rays, which have a very short 
wavelength and are similar in character to x-rays. 

4.1.3.1 Radioactive decay 

A common type of radioactive decay is when a neutron n 0 
in the nucleus of an atom spontaneously changes to a 
proton, p + , and a (3-particle (non-orbital electron). The 
(3-particle is at once ejected from the nucleus along with 
another elementary particle called an antineutrino, v, 
which has neither mass nor charge and need not concern 
us further here. The reaction can be written 

n 0 =>P , +P“+v (4.3) 

Radioactive decay is a statistical process. It is custom¬ 
ary to call the nucleus that decays the parent and the 
nucleus after decay the daughter. It is not possible to say in 
advance which nucleus will spontaneously decay. But the 
probability that any one will decay per second is a con¬ 
stant, called the decay constant or decay rate. The number 
of disintegrations per second is called the activity of the 
nucleus. The derived unit of radioactivity, corresponding 
to one disintegration per second, is the becquerel (Bq). 

Statistical behavior is really only applicable to large 
numbers, but the process of radioactive decay can be illus¬ 
trated with a simple example. Suppose we start with 1000 
nuclei, and the chance per second of a decay is 1 in 10, 
or 0.1. In the first second 10% of the parent nuclei 
spontaneously decay (i.e., 100 decays take place); the 
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number of parent nuclei is reduced to 900. The probability 
that any parent decays in the next second is still 1 in 10, so 
90 further decays can be expected. Thus after 2 seconds the 
number of parent nuclei is reduced to 810, after 3 seconds 
to 729, and so on. The total number of parent nuclei con¬ 
stantly gets smaller but in principle it never reaches zero, 
although after a long time it approaches zero asymptoti¬ 
cally. The decay is described by an exponential curve. 

If the decay rate is equal to A, then in a short time- 
interval dr the probability that a given nucleus will decay 
is A dr; if at any time we have P parent nuclei the number 
that decay in the following interval dr is P( A dr). The 
change dP in the number of P parent nuclei in a time 
interval dr due to spontaneous decays is 

dP = -APdr 
dP 

f=-AP (4.4) 

which has the solution 



P = P 0 e -A ' (4.5) 

Equation (4.5) describes the exponential decay of the 
number of parent nuclides, starting from an initial number 
P Q . While the number of parent nuclides diminishes, the 
number of daughter nuclides D increases (Fig. 4.3). D is 
the difference between P and P 0 and so is given by 

2> = P 0 -P = P 0 (l-e—■*0 (4.6) 


The original amount P 0 of the parent nuclide is not 
known; a rock sample contains a residual amount P of 
the parent nuclide and an amount D of the daughter 
product. Eliminating the unknown P 0 from Eq. (4.5) and 
Eq. (4.6) gives 

P = P(e Af — 1) (4.7) 


Equation (4.4) shows that the number of nuclear disin¬ 
tegrations per second - the activity of the nucleus - is pro¬ 
portional to the number of parent nuclei. The activity A 
at any given time is thus related to the initial activity A 0 by 
an equation similar to Eq. (4.5) 


A = — — — \ p P -A/ 

a dt Af 0 e 
A = A 0 e~ x ' 


(4.8) 


The experimental description of radioactive decay by 
Ernest (later Lord) Rutherford and Frederick Soddy in 
1902 was based on the observations of times needed for the 
activity of radioactive materials to decrease by steps of 
one-half. This time is known as the half-life of the decay. In 
the first half-life the number of parent nuclides decreases 
to a half, in the second half-life to a quarter, in the third to 
an eighth, etc. The number of daughter nuclides increases 
in like measure, so that the sum of parent and daughter 
nuclides is always equal to the original number P Q . Letting 


Fig. 4.3 Exponential decrease of the number of parent nuclides and 
the corresponding growth of the number of daughter nuclides in a 
typical radioactive decay process. 


P/P 0 equal 1/2 in Eq. (4.5) we get the relationship between 
half-life t m and decay constant A: 



(4.9) 


The decay rates and half-lives are known for more than 
1700 radioactive isotopes. Some are only produced in 
nuclear explosions and are so short-lived that they last only 
a fraction of a second. Other short-lived isotopes are pro¬ 
duced by collisions between cosmic rays and atoms in the 
upper atmosphere and have short half-lives lasting minutes 
or days. A number of naturally occurring isotopes have 
half-lives of thousands of years (kiloyear, ka), millions of 
years (megayear, Ma) or billions of years (gigayear, Ga), 
and can be used to determine the ages of geological events. 


4.1.4 Radiometric age determination 

Each age-dating scheme involves precise measurement of 
the concentration of an isotope. This is usually very 
small. If the radioactive decay has advanced too far, the 
resolution of the method deteriorates. The best results for 
a given isotopic decay scheme are obtained for ages less 
than a few half-lives of the decay. The decay constants 
and half-lives of some radioactive isotopes commonly 
used in dating geological events are listed in Table 4.1 and 
illustrated in Fig. 4.4. Historical and archeological arti¬ 
facts can be dated by the radioactive carbon method. 


4.1.4.1 Radioactive carbon 

The Earth is constantly being bombarded by cosmic radi¬ 
ation from outer space. Collisions of cosmic particles with 
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Table 4.1 Decay constants and half-lives of some 
naturally occurring, radioactive isotopes commonly used in 
geochronology 


Parent 

isotope 

Daughter 

isotope 

Decay constant 
[10 10 yr 4 ] 

Half-life 

[Ga] 

40 K 

89.5% 40 Ca 
10.5% 40 Ar 

5.543 

1.25 

87 Rb 

87 Sr 

0.1420 

48.8 

147 Sm 

143 Nd 

0.0654 

106.0 

232 Th 

2°8p b 

0.4948 

14.01 

235U 

20 7 p b 

9.8485 

0.704 

238u 

2° 6 Pb 

1.5513 

4.468 


0 1 2 3 4 5 



Age (Ga) 


Fig.4.4 Normalized radioactive decay curves of some important 

isotopes for dating the Earth and solar system. The arrows 

indicate the respective half-lives in 10 9 yr(data source: Dalrymple, 1991). 


atoms of oxygen and nitrogen in the Earth’s atmosphere 
produce high-energy neutrons. These in turn collide with 
nitrogen nuclei, transforming them into 14 C, a radioactive 
isotope of carbon. 14 C decays by (3-particle emission to 
14 N with a half-life of 5730 yr. The production of new 14 C 
is balanced by the loss due to decay, so that a natural equi¬ 
librium exists. Photosynthesis in animals and plants 
replenishes living tissue using carbon dioxide, which con¬ 
tains a steady proportion of 14 C. When an organism dies, 
the renewal stops and the residual 14 C in the organism 
decays radioactively. 

The radioactive carbon method is a simple decay analy¬ 
sis based on Eq. (4.5). The remaining proportion P of 14 C 
is measured by counting the current rate of (3-particle 
activity, which is proportional to P. This is compared to 
the original equilibrium concentration P Q . The time since 
the onset of decay is calculated by solving Eq. (4.5) using 
the decay rate for 14 C (A = 1.21 X 10 -4 yr -1 ). 


Sr mass spectrum 



to pump 


Fig. 4.5 Schematic design of a mass spectrometer, and hypothetical 
mass spectrum for analysis of strontium isotopes (after York and 
Farquhar, 1972). 

The radioactive carbon method has been valuable in 
dating events in the Holocene epoch, which covers the last 
10,000 yr of geological time, as well as events related to 
human prehistory. Unfortunately, human activity has dis¬ 
turbed the natural equilibrium of the replenishment and 
decay scheme. The concentration of 14 C in atmospheric 
carbon has changed dramatically since the start of the 
industrial age. Partly this is due to the combustion of 
fossil fuels like coal and oil as energy sources; they have 
long lost any 14 C and dilute its presence in atmospheric 
carbon. In the past half-century atmospheric testing of 
nuclear weapons doubled the concentration of 14 C in the 
atmosphere. Older materials may still be dated by the 14 C 
method, although natural fluctuations in P 0 must be 
taken into account. These are due to variations in inten¬ 
sity of the geomagnetic field, which acts as a partial shield 
against cosmic radiation. 

4.1.4.2 The mass spectrometer 

In the years before World War II, physicists invented the 
mass spectrometer, an instrument for measuring the mass 
of an ion. The instrument was further refined during the 
development of the atomic bomb. After the war it was 
adopted into the earth sciences to determine isotopic 
ratios and became a vital part of the process of isotopic 
age determination. 

The mass spectrometer (Fig. 4.5) utilizes the different 
effects of electrical and magnetic fields on a charged par¬ 
ticle, or ion. First, the element of interest is extracted 
from selected mineral fractions or from pulverized whole 
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rock. The extract is purified chemically before being 
introduced into the mass spectrometer, where it is ionized. 
For the analysis of a gas like argon, bombardment by a 
stream of electrons may be used. Solid elements, such as 
potassium, rubidium, strontium or uranium, are vapor¬ 
ized by depositing a sample on an electrically heated fila¬ 
ment, or by heating with a high-energy laser beam so that 
the sample is vaporized directly. The latter process is 
known as “laser ablation.” The ions then enter an evacu¬ 
ated chamber and pass through an “ion gun,” where they 
are accelerated by an electrical field and filtered by a 
velocity selector. This device uses electrical and magnetic 
fields at right angles to the ion beam to allow only the 
passage of ions with a selected velocity v. The ion beam is 
next subjected to a powerful uniform magnetic field B at 
right angles to its direction of motion. An ion with charge 
q experiences a Lorentz force (see Section 5.2.4) equal to 
(qvB) perpendicular to its velocity and to the magnetic 
field. Its trajectory is bent to form a circular arc of radius 
r; the centrifugal force on the particle is equal to ( mv 2 lr). 
The curved path focuses the beam on a collector device 
that measures the intensity of the incident beam. The 
radius of the circular arc is given by equating the Lorentz 
and centrifugal forces: 


sensitive, versatile instrument for determining trace 
element abundances with a detection level in the region of 
parts per trillion. It can be used for analyzing both liquid 
and solid samples, and has many applications in fields 
outside the earth and environmental sciences. 

An important development in the field of mass spec¬ 
trometry is the ion microprobe mass analyzer. In conven¬ 
tional mass spectrometry the analysis of a particular 
element is preceded by separating it chemically from a 
rock sample. The study of individual minerals or the vari¬ 
ation of isotopic composition in a grain is very difficult. 
The ion microprobe avoids contamination problems that 
can arise during the chemical separation and its high res¬ 
olution permits isotopic analysis of very small volumes. 
Before a sample is examined in the instrument its surface 
is coated with gold or carbon. A narrow beam of nega¬ 
tively charged oxygen ions, about 3-10 p,m wide, is 
focussed on a selected grain. Ions are sputtered out of the 
surface of the mineral grain by the impacting ion beam, 
accelerated by an electrical field and separated magneti¬ 
cally as in a conventional mass spectrometer. The ion 
microprobe allows description of isotopic concentrations 
and distributions in the surface layer of the grain and pro¬ 
vides isotopic ratios. The instrument allows isotopic 
dating of individual mineral grains in a rock, and is espe¬ 
cially well suited to the analysis of very small samples. 


The focal point of the path is determined by the mass 
of the ion and the strength of the magnetic field B. In the 
case of a strontium analysis, the ion beam leaving the ion 
gun contains the four isotopes 88 Sr, 87 Sr, 86 Sr and 84 Sr. 
The beam splits along four paths, each with a different 
curvature. The magnetic field is adjusted so that only one 
isotope at a time falls on the collector. The incident 
current is amplified electronically and recorded. A spec¬ 
trum is obtained with peaks corresponding to the inci¬ 
dence of individual isotopes (Fig. 4.5, inset). The 
intensity of each peak is proportional to the abundance 
of the isotope, which can be measured with a precision of 
about 0.1%. However, the relative peak heights give the 
relative abundances of the isotopes to a precision of 
about 0.001%. 

Significant improvements in the design of mass spec¬ 
trometer systems have increased their sensitivity further. 
Following laser ablation, the vaporized sub-micrometer- 
sized particles are mixed with an argon plasma. This is a 
gas consisting of positively charged argon ions and an 
equal number of unbound electrons at a temperature of 
6000-10,000 K. The plasma ionizes the particles, which 
are extracted into a high vacuum before passing through 
an ion gun into the mass spectrometer. A drawback of the 
single-collector design described above is that it is vulner¬ 
able to fluctuations in the beam intensity. This problem 
has been overcome with the introduction of multiple- 
collector instruments, which allow simultaneous mea¬ 
surement of the split isotopic beams. The inductively 
coupled plasma mass spectrometer (ICP-MS) is a highly 


4.1.4.3 Rubidium-strontium 

The use of a radioactive decay scheme as given by Eq. 
(4.7) assumes that the amount of daughter isotope in a 
sample has been created only by the decay of a parent 
isotope in a closed system. Usually, however, an unknown 
initial amount of the daughter isotope is present, so that 
the amount measured is the sum of the initial concentra¬ 
tion D 0 and the fraction derived from decay of the parent 
P 0 . The decay equation is modified to 

D = D 0 + P(e x ' — 1) (4.11) 

The need to know the amount of initial daughter 
isotope D 0 is obviated by the analytical method, which 
makes use of a third isotope of the daughter element to 
normalize the concentrations of daughter and parent iso¬ 
topes. The rubidium-strontium method illustrates this 
technique. 

Radioactive rubidium ( 87 Rb) decays by (3-particle 
emission to radiogenic strontium ( 87 Sr). The non-radi- 
ogenic, stable isotope 86 Sr has approximately the same 
abundance as the radiogenic product 87 Sr, and is chosen 
for normalization. Writing 87 Rb for P, 87 Sr for D in Eq. 
(4.11) and dividing both sides by 86 Sr gives 





(e A( — 


1 ) 


(4.12) 


In a magmatic rock, the isotopic ratio ( 87 Sr/ 86 Sr) 0 is 
uniform in all minerals precipitated from the melt because 
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Fig. 4.6 Rb/Sr isochron for the Uivak gneisses from eastern Labrador. 
The slope of the isochron gives an age of 3.622 ± 0.072 Ga (after Hurst 
eta!., 1975). 


the isotopes are chemically identical (i.e., they have the 
same atomic number). However, the proportion of the 
different elements Rb and Sr varies from one mineral to 
another. Equation (4.12) can be compared with the equa¬ 
tion of a straight line, such as 

y = y 0 + mx (4.13) 

The ratio 87 Sr/ 86 Sr is the dependent variable, y, and the 
ratio ( 87 Rb/ 86 Sr) is the independent variable, x. If we 
measure the isotopic ratios in several samples of the rock 
and plot the ratio 87 Sr/ 86 Sr against the ratio 87 Rb/ 86 Sr, we 
get a straight line, called an isochron (Fig. 4.6). The inter¬ 
cept with the ordinate axis gives the initial ratio of the 
daughter isotope. The slope ( m ) of the line gives the age of 
the rock, using the known decay constant A: 

t = jln( 1 + m) = 7.042 X 10 10 ln( 1 + m ) (4.14) 

Because of its long half-life of 48.8 Ga (Fig. 4.4) the 
Rb-Sr method is well suited for dating very old events in 
Earth’s history. It has been used to obtain the ages of 
meteorites and lunar samples, as well as some of the oldest 
rocks on Earth. For example, the slope of the Rb-Sr 
isochron in Fig. 4.6 yields an age of 3.62 Ga for the Early 
Precambrian (Archean) Uivak gneisses from eastern 
Fabrador. 

The Rb-Sr and other methods of isotopic dating can 
be applied to whole rock samples or to individual miner¬ 
als separated from the rock. The decay equation applies 
only to a closed system, i.e., to rocks or minerals which 
have undergone no loss or addition of the parent or 
daughter isotope since they formed. A change is more 
likely in a small mineral grain than in the rock as a 
whole. Individual mineral isochrons may express the age 


of postformational metamorphism, while the whole rock 
isochron gives an older age. On the other hand, spurious 
ages may result from whole rock analyses if samples of 
different origin, and hence differing composition, are 
inadvertently used to construct an isochron. 

4.1.4.4 Potassium-argon 

For several reasons the potassium-argon (K-Ar) method 
is probably the age-dating technique most commonly used 
by geologists. The parent isotope, potassium, is common 
in rocks and minerals, while the daughter isotope, argon, 
is an inert gas that does not combine with other elements. 
The half-life of 1250 Ma (1.25 Ga) is very convenient. On 
the one hand, the Earth’s age is equal to only a few half- 
lives, so radiogenic 40 K is still present in the oldest rocks; 
on the other hand, enough of the daughter isotope 40 Ar 
accumulates in 10 4 yr or so to give fine resolution. In the 
late 1950s the sensitivity of mass spectrometers was 
improved by constructing instruments that could be pre¬ 
heated at high temperature to drive off contaminating 
atmospheric argon. This made it possible to use the K-Ar 
method for dating lavas as young as a few million years. 

Radioactive 40 K constitutes only 0.01167% of the K in 
rocks. It decays in two different ways: (a) by (3-particle 
emission to 40 Ca., 0 with decay rate A Ca = 4.962 X 10 -10 yr -1 
and (b) by electron capture to 40 Ar ls with decay rate 
A Ar = 0.581 X 10 -10 yr 1 . The combined decay constant 
(A=A C a + A A r) is equal to 5.543 XlO -10 yr -1 . The decay 
schemes are, respectively: 

(a) 40 K 19 ^ 40 Ca 20 + [3- 

(b) 40 K 19 + e => 40 Ar lg (4.15) 

Electron capture by a nucleus is more difficult and 
rarer than p-particle emission, so the decay of 40 K ig to 
40 Ca 20 is more common than the formation of 40 Ar lg . The 
ratio of electron capture to P-particle decay, A Ar /A Ca , is 
called the branching ratio ; it equals 0.117. Thus, only the 
fraction A Ar /(A Ar + A Ca ), or 10.5%, of the initial radioac¬ 
tive potassium decays to argon. The initial amount of 
radiogenic 40 Ca usually cannot be determined, so the 
decay to Ca is not used. Allowing for the branching ratio, 
the K-Ar decay equation is analogous to Eq. (4.7) with 
40 Ar for the accumulated amount of the daughter 
product D and 40 K for the residual amount of the parent 
product P: 

40 Ar = . A f Lr . 40 K(e A ' - 1) 

A Ar + A Ca 

= 0.1048 40 K(e A( - 1) (4.16) 

The potassium-argon method is an exception to the 
need to use isochrons. It is sometimes called an accumula¬ 
tion clock, because it is based on the amount of 40 Ar that 
has accumulated. It involves separate measurements of 
the concentrations of the parent and daughter isotopes. 
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The amount of 40 K is a small but constant fraction 
(0.01167%) of the total amount of K, which can be mea¬ 
sured chemically. The 40 Ar is determined by mixing with a 
known amount of another isotope 38 Ar before being 
introduced into the mass spectrometer. The relative abun¬ 
dance of the two argon isotopes is measured and the con¬ 
centration of 40 Ar is found using the known amount of 
38 Ar. By re-ordering Eq. (4.16) and substituting for the 
decay constant, the age of the rock is obtained from the 
K-Ar age equation: 


e A '= 1 +9.54 


40 Ar 

4° k 


r= 1.804 X 10 9 ln(l +9.54^) (4 - 17) 

The 40 Ar in a molten rock easily escapes from the melt. It 
may be assumed that all of the radiogenic 40 Ar now 
present in a rock has formed and accumulated since the 
solidification of the rock. The method works well on 
igneous rocks that have not been heated since they 
formed. It cannot be used in sedimentary rocks that 
consist of the detritus of older rocks. Often it is unsuc¬ 
cessful in metamorphic rocks, which may have compli¬ 
cated thermal histories. A heating phase may drive out the 
argon, thereby re-setting the accumulation clock. This 
problem limits the usefulness of the K-Ar method for 
dating meteorites (which have a fiery entry into Earth’s 
atmosphere) and very old terrestrial rocks (because of 
their unknown thermal histories). The K-Ar method can 
be used for dating lunar basalts, as they have not been 
reheated since their formation. 
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Fig. 4.7 (a) Hypothetical age spectrum and (b) 40 Ar/ 39 Ar isochron for a 
sample that has experienced no secondary heating; (c) hypothetical age 
spectrum and (d) 40 Ar/ 39 Ar isochron for a sample that was reheated but 
retains some original argon (after Dalrymple, 1991). 


To determine this fraction a control sample of known 
age is irradiated at the same time. By monitoring the 
change in isotopic ratios in the control sample the frac¬ 
tion of 39 K nuclei converted to 39 Ar can be deduced. The 
age equation is similar to that given in Eq. (4.17) for the 
K-Ar method. However, 39 Ar replaces 40 K and an empir¬ 
ical constant J replaces the constant 9.54. The value of J 
is found from the control sample whose age is known. The 
40 Ar/ 39 Ar age equation is 


4.1.4.5 Argon-argon 

Some uncertainties related to post-formational heating of 
a rock are overcome in a modification of the K-Ar 
method that uses the 40 Ar/ 39 Ar isotopic ratio. The 
method requires conversion of the 39 K in the rock to 39 Ar. 
This is achieved by irradiating the sample with fast neu¬ 
trons in an atomic reactor. 

The terms slow and fast refer to the energy of the 
neutron radiation. The energy of slow neutrons is compa¬ 
rable to their thermal energy at room temperature; they are 
also referred to as thermal neutrons. Slow neutrons can be 
captured and incorporated into a nucleus, changing its size 
without altering its atomic number. The capture of slow 
neutrons can increase the size of unstable uranium nuclei 
beyond a critical value and initiate fission. In contrast, fast 
neutrons act on a nucleus like projectiles. When a fast 
neutron collides with a nucleus it may eject a neutron or 
proton, while itself being captured. If the ejected particle is 
another neutron, no effective change results. But if the 
ejected particle is a proton (with a (3-particle to conserve 
charge), the atomic number of the nucleus is changed. For 
example, bombarding 39 K nuclei in a rock sample with fast 
neutrons converts a fraction of them to 39 Ar. 


f= 1.804 X 10 9 ln(l (4 - 18) 

In the 40 Ar/ 39 Ar method the sample is heated progres¬ 
sively to drive out argon at successively higher tempera¬ 
tures. The 40 Ar/ 39 Ar isotopic ratio of the argon released 
at each temperature is determined in a mass spectrome¬ 
ter. The age computed for each increment is plotted 
against the percentage of Ar released. This yields an age 
spectrum. If the rock has not been heated since it was 
formed, the argon increments given out at each heating 
stage will yield the same age (Fig. 4.7a). An isochron can 
be constructed as in the Rb-Sr method by measuring the 
abundance of a non-radiogenic 36 Ar fraction and com¬ 
paring the isotopic ratios 40 Ar/ 36 Ar and 39 Ar/ 36 Ar. In an 
unheated sample all points fall on the same straight line 
(Fig. 4.7b). 

If the rock has undergone post-formational heating, 
the argon formed since the heating is released at lower 
temperatures than the original argon (Fig. 4.7c). It is not 
certain why this is the case. The argon probably passes out 
of the solid rock by diffusion, which is a thermally acti¬ 
vated process that depends on both the temperature and 
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Fig. 4.8 40 Ar/ 39 Ar age spectra for three samples of melt rocks from 
the Chicxulub Cretaceous-Tertiary impact crater (after Swisher eta!., 
1992). 


the duration of heating. Unless the post-formational 
heating is long and thorough, only the outside parts of 
grains may be forced to release trapped argon, while the 
argon located in deeper regions of the grains is retained. 
In a sample that still contains original argon the results 
obtained at high temperatures form a plateau of consis¬ 
tent ages (Fig. 4.7c) from which the optimum age and its 
uncertainty may be calculated. On an isochron diagram 
reheated points deviate from the straight line defined by 
the high-temperature isotopic ratio (Fig. 4.7d). 

The high precision of 40 Ar/ 39 Ar dating is demon¬ 
strated by the analysis of samples of melt rock from the 
Chicxulub impact crater in the Yucatan peninsula of 
Mexico. The crater is the favored candidate for the impact 
site of a 10 km diameter meteorite which caused global 
extinctions at the close of the Cretaceous period. Laser 
heating was used to release the argon isotopes. The age 
spectra of three samples show no argon loss (Fig. 4.8). 
The plateau ages are precisely defined and give a weighted 
mean age of 64.98 ± 0.05 Ma for the impact. This agrees 
closely with the 65.01 ± 0.08 Ma age of tektite glass found 
at a Cretaceous-Tertiary boundary section in Haiti, and 
so ties the impact age to the faunal extinctions. 

The age spectrum obtained in Ar-Ar dating of the 
Menow meteorite is more complicated and shows the 
effects of ancient argon loss. Different ages are obtained 
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Fig. 4.9 40 Ar/ 39 Ar age spectrum for the Menow meteorite, showing a 
plateau at4.48±0.06Ga (afterTurneref a/., 1978). 

during incremental heating below about 1200 °C (Fig. 4.9). 
The plateau ages above this temperature indicate a mean 
age of 4.48 ±0.06 Ga. The shape of the age spectrum sug¬ 
gests that about 25% of the Ar was lost much later, about 
2.5 Ga ago. 

4.1.4.6 Uranium-lead: the concordia-discordia diagram 

Uranium isotopes decay through a series of intermediate 
radioactive daughter products, but eventually they result 
in stable end-product isotopes of lead. Each of the decays 
is a multi-stage process but can be described as though it 
had a single decay constant. We can describe the decay of 
238 U to 206 Pb by 

206pk 

238u= eW - 1 (4.19) 

Likewise the decay of 235 U to 207 Pb can be written 

23^ = eW -' (4.20) 

The 235 U and 238 U isotopes have well known decay 
constants: A 235 = 9.8485 X 10" 10 yr -1 , A 238 = 1.55 1 25 X 
10 -1 ° yr -1 . A graph of the 206 Pb/ 238 U ratio against the 
207 Pb/ 235 U ratio is a curve, called the concordia line (Fig. 
4.10). All points on this line satisfy Eqs. (4.19) and (4.20) 
simultaneously. The concordia line has a curved shape 
because the decay rates of the uranium isotopes, A,, 38 and 
A 235 respectively, are different. Calculations of the iso¬ 
topic ratios given by Eqs. (4.19) and (4.20) defining the 
concordia line for the past 5 Ga are listed in Table 4.2. 
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Table 4.2 Calculation of points on the concordia curve 


The Pb/U isotopic ratios listed below define the concordia diagram for the last 5 Ga. They were computed with A 238 = 1.55 1 25 
X 10 _10 yr _1 and A 235 = 9.8485 X 10 _10 yr _1 as decay constants in the following formulas: 


206 Pb _ x 238 / , 
238y _ e 1 

207 Pb _ \ t j 

235JJ e 235 1 




Age [Ga] 

206 Pb/ 238 U 

207p b/ 235 U 

Age [Ga] 

206p b/ 238 U 

207 Pb/ 235 U 

0.1 

0.0156 

0.1035 

2.6 

0.4968 

11.9437 

0.2 

0.0315 

0.2177 

2.7 

0.5202 

13.2834 

0.3 

0.0476 

0.3437 

2.8 

0.5440 

14.7617 

0.4 

0.0640 

0.4828 

2.9 

0.5681 

16.3930 

0.5 

0.0806 

0.6363 

3.0 

0.5926 

18.1931 

0.6 

0.0975 

0.8056 

3.1 

0.6175 

20.1795 

0.7 

0.1147 

0.9925 

3.2 

0.6428 

22.3716 

0.8 

0.1321 

1.1987 

3.3 

0.6685 

24.7905 

0.9 

0.1498 

1.4263 

3.4 

0.6946 

27.4597 

1.0 

0.1678 

1.6774 

3.5 

0.7211 

30.4052 

1.1 

0.1861 

1.9545 

3.6 

0.7480 

33.6556 

1.2 

0.2046 

2.2603 

3.7 

0.7753 

37.2424 

1.3 

0.2234 

2.5977 

3.8 

0.8030 

41.2004 

1.4 

0.2426 

2.9701 

3.9 

0.8312 

45.5681 

1.5 

0.2620 

3.3810 

4.0 

0.8599 

50.3878 

1.6 

0.2817 

3.8344 

4.1 

0.8889 

55.7063 

1.7 

0.3018 

4.3348 

4.2 

0.9185 

61.5752 

1.8 

0.3221 

4.8869 

4.3 

0.9485 

68.0517 

1.9 

0.3428 

5.4962 

4.4 

0.9789 

75.1984 

2.0 

0.3638 

6.1685 

4.5 

1.0099 

83.0847 

2.1 

0.3851 

6.9105 

4.6 

1.0413 

91.7873 

2.2 

0.4067 

7.7292 

4.7 

1.0732 

101.3906 

2.3 

0.4287 

8.6326 

4.8 

1.1056 

111.9878 

2.4 

0.4511 

9.6296 

4.9 

1.1385 

123.6818 

2.5 

0.4738 

10.7297 

5.0 

1.1719 

136.5861 



Fig. 4.10 Hypothetical example of a U-Pb concordia-discordia 
diagram. Lead loss gives points B, C and D on a discordia line. It 
intersects the concordia curve at A, the age of the rock, and at E, the 
time of the event that caused lead loss. Marks on the concordia curve 
indicate age in Ga. 


The amounts of the daughter lead isotopes accumu¬ 
late at different rates. Lead is a volatile element and is 
easily lost from minerals, but this does not alter the iso¬ 
topic ratio of the lead that remains. Loss of lead will 
cause a point to deviate from the concordia line. However, 
because the isotopic ratio remains constant the deviant 
point lies on a straight line between the original age and 
the time of the lead-loss. This line corresponds to points 
that do not agree with the concordia curve; it is called the 
discordia line (Fig. 4.10). Different mineral grains in the 
same rock experience different amounts of lead loss, so 
the isotopic ratios in these grains give different points B, 
C and D on the discordia. The intersection of the concor¬ 
dia and discordia lines at A gives the original age of the 
rock, or in the case that it has lost all of its original lead, 
the age of this event. The intersection of the lines at E 
gives the age of the event that caused the lead loss. 

The U-Pb method has been used to date some of the 
oldest rocks on Earth. The Duffer formation in the 
Pilbara Supergroup in Western Australia contains Early 
Precambrian greenstones. U-Pb isotopic ratios for 
zircon grains separated from a volcanic rock in the 
Duffer formation define a discordia line that intercepts 
the theoretical concordia curve at 3.45 ± 0.02 Ga 
(Fig. 4.11). 
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Fig. 4.11 U-Pb concordia-discordia diagram for zircon grains from a 
volcanic rock in the Precambrian Duffer formation in the Pilbara Block, 
Western Australia (after Pidgeon, 1978). 


4.1.4.7 Lead-lead isochrons 

It is possible to construct isochron diagrams for the U-Pb 
decay systems. This is done (as in the Rb/Sr decay 
scheme) by expressing the radiogenic isotopes 206 Pb and 
207 Pb as ratios of 204 Pb, the non-radiogenic isotope of 
lead. The decay of 238 U to 206 Pb (or of 235 U to 207 Pb) can 
then be described as in Eq. (4.11), where the initial value 
of the daughter product is unknown. This gives the decay 
equations 


20 7 p b \ / 207p b \ 

204 Pb/ \ 204 Pb/o 



(4-21) 


(^ ( eA 2V - l) (422) 

\ 204 Pb/ \ 204 Pb/o \ 204 Pb/ ( ’ 

These equations can be combined into a single isochron 
equation: 


207pb\ 

204 Pb/ 

206 Pb\ 
204 Pb / 


( 207 Pb 
\ 204 Pb/o 

( 206 Pb \ 
\ 204 Pb/o 


/235u\(e A 235'- 1) 

\238U J ( e^238 f — 1) 


(4.23) 


The ratio of 235 U to 238 U as measured today has been 
found to have a constant value 1/137.88 in lunar and ter¬ 
restrial rocks and in meteorites. The decay constants are 
well known, so for a given age t the right side in Eq. (4.23) 
is constant. The equation then has the form 

x-x« = m < 4 - 24 > 

This is the equation of a straight line of slope m 
through the point (x 0 , y 0 ). The initial values of the lead 
isotope ratios are not known, but a plot of the isotopic 


ratio 207 Pb/ 204 Pb against the ratio 206 Pb/ 204 Pb is a straight 
line. The age of the rock cannot be found algebraically. 
Values for t must be inserted successively on the right side 
of Eq. (4.23) until the observed slope is obtained. 

4.1.5 Ages of the Earth and solar system 

An important source of information concerning the 
Earth’s age is the radiometric dating of meteorites. A 
meteor is a piece of solid matter from space that penetrates 
the Earth’s atmosphere at a hypersonic speed of typically 
10-20kms -1 . Atmospheric friction causes it to become 
incandescent. Outside the Earth’s atmosphere it is known 
as a meteoroid', any part that survives passage through the 
atmosphere and reaches the Earth’s surface is called a 
meteorite. Most meteorites are thought to originate in the 
asteroid belt between the orbits of Mars and Jupiter (see 
Section 1.1.3.2), although tracking of entry paths shows 
that before colliding with Earth they have highly elliptical 
counterclockwise orbits about the Sun (in the same sense 
as the planets). Meteorites are often named after the place 
on Earth where they are found. They can be roughly 
divided into three main classes according to their composi¬ 
tion. Iron meteorites consist of an alloy of iron and nickel; 
stony meteorites consist of silicate minerals; and 
iron-stony meteorites are a mixture of the two. The stony 
meteorites are further subdivided into chondrites and 
achondrites. Chondrites contain small spherules of high- 
temperature silicates, and constitute the largest fraction 
(more than 85%) of recovered meteorites. The achondrites 
range in composition from rocks made up essentially of 
single minerals like olivine to rocks resembling basaltic 
lava. Each category is further subdivided on the basis of 
chemical composition. All main types have been dated 
radiometrically, with most studies being done on the domi¬ 
nant chondrite fraction. There are no obvious age 
differences between the meteorites of the various groups. 
Chondrites, achondrites and iron meteorites consistently 
yield ages of around 4.45-4.50 Ga (Fig. 4.12). 

In 1969, a large carbonaceous chondritic meteorite 
entered the Earth’s atmosphere and exploded, scattering 
its fragments over a large area around Pueblito de Allende 
in Mexico. The Allende meteorite contained refractory 
inclusions enriched in uranium relative to lead. Refractory 
inclusions are small aggregates of material that solidify 
and vaporize at high temperature, and thus can preserve 
information from the early history of the meteorite, and 
hence of the solar system. The high U/Pb ratios of the 
Allende inclusions gave precise 207 Pb/ 206 Pb dates averag¬ 
ing 4.566 ± 0.002 Ga (Allegre et al., 1995), which is the 
current best estimate of the age of the solar system. 

A further bound on the age of the Earth is placed by the 
age of the Moon, which is thought to have formed during 
the accretion of the Earth. Six American manned missions 
and three Russian unmanned missions obtained samples of 
the Moon’s surface, which have been dated by several iso¬ 
topic techniques. The ages obtained differ according to the 
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Fig. 4.12 Radiometric age ranges of the oldest terrestrial and lunar rocks 
and meteorites (compiled from Dalrymple, 1991 and Halliday, 2001). 
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source areas of the rocks. The dark areas of the Moon’s 
surface, the so-called lunar seas or maria, were formed when 
enormous outpourings of basaltic lava filled up low-lying 
areas - such as the craters formed by large meteoritic 
impacts - and so created flat surfaces. The lunar volcanism 
may have persisted until 1 Ga ago. The light areas on the 
Moon’s surface are rough, extensively cratered highlands 
that reach elevations of 3000M000 m above the maria. They 
represent the top part of the lunar crust and are the oldest 
regions of the Moon. Frequent collisions with large aster¬ 
oids early in the Moon’s history produced numerous craters 
and pulverized the lunar crust, leaving impact breccia, rock 
fragments and a dust layer a few meters thick, called the 
lunar regolith. Age dates from the highland rocks range from 
about 3.5-4.5Ga, but the oldest age reported for a lunar 
rock is 4.51 ± 0.07 Ga, obtained by the Rb-Sr method. 

In contrast to the ages of meteorites and the Moon, 
no rock of comparable age has been preserved on the 
Earth. Among the oldest terrestrial rocks are the Isua 
metasediments from western Greenland. They have been 
extensively studied and yield an age of 3.77 Ga that is 
consistent with different decay schemes. Even older 
are the Acasta gneisses in the northwestern part of the 
Canadian shield, which gave a U-Pb age of 3.96 Ga. 
Terrestrial rocks from the oldest Precambrian shield 
areas in Australia, South Africa, South America and 
Antarctica give maximum ages of 3.4-3.8 Ga. 

Older ages are obtained from individual grains of 
zircon (ZrSi0 4 ), which forms as a trace mineral in granitic 
magmas. Zircon is a very durable mineral and can survive 
erosion that destroys the original rock. The zircon grains 
then become incorporated in sedimentary rocks. Zircon 
grains from sedimentary rocks in Australia have yielded 


U-Pb ages of 4.1-4.2 Ga; a fragment of a zircon grain, 
only 200 pan in diameter, has been dated by the U-Pb 
method at 4.404 ± 0.008 Ga, making it the oldest dated 
terrestrial solid. It is also possible to carry out oxygen- 
isotope analyses on these ancient zircon grains. Although 
fragmentary, the results from zircon grains are important 
for understanding conditions on the primeval Earth. The 
scanty evidence suggests a possible history for the devel¬ 
opment of the hot early Earth, as follows. 

The Sun and its accretionary disk, from which the solar 
system eventually evolved, formed 4.57 Ga ago. The accre¬ 
tion of the Earth and formation of its iron core lasted for 
about the first 100 Ma of Earth’s early history. The accre¬ 
tion of Mars, which has only about one-ninth the mass of 
Earth, would have been completed in the first 30 Ma. Later 
in Earth’s accretion, after perhaps 60 Ma, the Moon was 
formed as a result of the collision - called the “Giant 
Impact” - of a Mars-sized planetesimal with the proto- 
Earth, which was still much smaller than its present size. In 
the interval up to 4 Ga ago - called the Hadean (for hell¬ 
like) era - the early Earth was hot and covered by magma 
oceans; any crust would melt or be destroyed in the intense 
meteoritic bombardment. Water, if present, would be 
vaporized. After about 150Ma the proto-Earth might 
have cooled sufficiently so that an early granitic crust, and 
possibly liquid water, could form. Only a few zircon grains 
that survived this period remain as possible witnesses. 
Repeated bombardments may have destroyed the crust 
and vaporized the water repeatedly in the following 
400 Ma. Around 4 Ga ago the oldest surviving continental 
crust was formed. 

This scenario is speculative and it is not unique. Some 
oxygen-isotope studies on zircon grains have been inter¬ 
preted as indicating a cool early Earth. High 8 ls O values 
(see Box 5.2) measured in these zircons imply surface tem¬ 
peratures low enough for liquid water; uniform condi¬ 
tions throughout the Archean era (4.4-2.6 Ga) were 
deduced. This model of early Earth suggests that mete¬ 
oritic impacts may have been less intense than usually 
hypothesized. The model of a hot early Earth is more 
popularly accepted but the true history is not yet known. 
The events that happened in the first few hundred million 
years of its history are among Earth's best-kept secrets. 


4.2 THE EARTH'S HEAT 

4.2.1 introduction 

The radiant energy from the Sun, in conjunction with 
gravitational energy, determines almost all natural 
processes that occur at or above the Earth's surface. The 
hot incandescent Sun emits radiation in a very wide range 
of wavelengths. The radiation incident on the Earth is 
largely reflected into space, part enters the atmosphere 
and is reflected by the clouds or is absorbed and re¬ 
radiated into space. A very small part reaches the surface, 
where it is also partly reflected, especially from the water 
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surfaces that cover three-quarters of the globe. Some is 
absorbed (e.g., by vegetation) and serves as the source of 
power for various natural cycles. A small fraction is used 
to heat up the Earth's surface, but it only penetrates a 
short distance, some tens of centimeters in the case of the 
daily cycle and a few tens of meters for the annual 
changes. As a result, solar energy has negligible influence 
on internal terrestrial processes. Systems as diverse as the 
generation of the geomagnetic field and the motion of 
global lithospheric plates are ultimately powered by the 
Earth’s internal heat. 

The Earth is constantly losing heat from its interior. 
Although diminutive compared to solar energy, the loss 
of internal heat is many times larger than the energy lost 
by other means, such as the change in Earth’s rotation 
and the energy released in earthquakes (Table 4.3). Tidal 
friction slows down the Earth’s rotation, and the change 
can be monitored accurately with modern technology 
such as very long baseline interferometry (VLBI) and the 
satellite-based geodetic positioning system (GPS). The 
associated loss of rotational energy can be computed 
accurately. The elastic energy released in an earthquake 
can be estimated reliably, and it is known that most of 
the energy is released in a few large shocks. However, the 
annual number of large earthquakes is very variable. The 
number with magnitude M>1 varies between about 10 
and 40 (see Fig. 3.49), giving estimates of the annual 
energy release from about 5X10 17 J to 4X10 19 J. The 
energies of tidal deceleration and earthquakes are small 
fractions of the geothermal flux, which is the most impor¬ 
tant form of energy originating in the body of the Earth. 

The Earth's internal heat derives from several sources 
(Section 4.2.5). For the past 4 Ga or so the Earth's heat has 
been obtained from two main sources. One is the cooling 
of the Earth since its early history, when internal tempera¬ 
tures were much higher than they now are. The other is the 
heat produced by the decay of long-lived radioactive iso¬ 
topes. This is the main source of the Earth’s internal heat, 
which, in turn, powers all geodynamic processes. 

4.2.2 Thermodynamic principles 

In order to describe thermal energy it is necessary to 
define clearly some important thermodynamic parame¬ 
ters. The concepts of temperature and heat are easily - 
and frequently - confused. Temperature - one of the 
seven fundamental standard parameters of physics - is a 
quantitative measure of the degree of hotness or coldness 
of an object relative to some standard. Heat is a form of 
energy which an object possesses by virtue of its tempera¬ 
ture. The difference between temperature and heat is illus¬ 
trated by a simple example. Imagine a container in which 
the molecules of a gas move around at a certain speed. 
Each molecule has a kinetic energy proportional to the 
square of its velocity. There may be differences from one 
molecule to the next but it is possible to determine the 
mean kinetic energy of a molecule. This quantity is pro- 


Table 4.3 Estimates of notable contributions to the 
Earth’s annual energy budget 


Energy source 

Annual 
energy [J] 

Normalized 
[geothermal 
flux = f] 

Reflection and re-radiation 
of solar energy 

5.4 X10 24 

= 4000 

Geothermal flux from 

Earth's interior 

1.4 X10 21 

1 

Rotational deceleration by 
tidal friction 

= 10“° 

= 0.1 

Elastic energy in earthquakes 

= 10 19 

= 0.01 


portional to the temperature of the gas. If we add up the 
kinetic energies of all molecules in the container we 
obtain the amount of heat it contains. If heat is added to 
the container from an external source, the gas molecules 
speed up, their mean kinetic energy increases and the tem¬ 
perature of the gas rises. 

The change of temperature of a gas is accompanied by 
changes of pressure and volume. If a solid or liquid is 
heated, the pressure remains constant but the volume 
increases. Thermal expansion of a suitable solid or liquid 
forms the principle of the thermometer for measuring 
temperature. Although Galileo reputedly invented an 
early and inaccurate “thermoscope,” the first accurate 
thermometers - and corresponding temperature scales - 
were developed in the early eighteenth century by Gabriel 
Fahrenheit (1686-1736), Ferchaut de Reaumur (1683— 
1757) and Anders Celsius (1701-1744). Their instruments 
utilized the thermal expansion of liquids and were cali¬ 
brated at fixed points such as the melting point of ice and 
the boiling point of water. The Celsius scale is the most 
commonly used for general purposes, and it is closely 
related to the scientific temperature scale. 

Temperature apparently has no upper limit. For 
example, the temperature of the surface of the Sun is less 
than 10,000 K but the temperature at its center is around 
10,000,000 K and temperatures greater than 100,000,000 K 
have been achieved in physics experiments. But as heat is 
removed from an object it becomes more and more difficult 
to lower its temperature further. The limiting low tempera¬ 
ture is often called “absolute zero” and is taken as the zero 
of the Kelvin temperature scale, named in honor of Lord 
Kelvin. Its divisions are the same as the Celsius scale and 
the temperature unit is called a kelvin. The scale is defined 
so that the triple point of water - where the solid, liquid 
and gaseous phases of water can coexist in equilibrium - is 
equal to 273.16 kelvins, written 273.16 K. 

Heat was imagined by early investigators to be 
exchanged between bodies by the flow of a mystic fluid, 
called caloric. However, in the mid nineteenth century 
James Joule, an English brewer, demonstrated in a series 
of careful experiments that mechanical energy could be 
converted into heat. In his famous experiment, falling 
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weights drove a paddle wheel in a container of water, 
raising its temperature. The increase was tiny, less than 
0.3 K, yet Joule was able to compute the amount of 
energy needed to raise the temperature by 1 K. His esti¬ 
mate of this energy - called the mechanical equivalent of 
heat - was within 5% of our modern value. The unit of 
energy is called the joule in recognition of his pioneering 
efforts. Originally, however, the unit of heat energy was 
defined as the amount needed to raise the temperature of 
one gram of water from 14.5 °C to 15.5 °C. This unit, the 
calorie (cal), is equivalent to 4.1868 J. 

In physics and engineering it is often important to 
know the change of heat energy in a unit of time, known 
as the power. The unit of power is the watt, named after 
James Watt, the Scottish engineer who played an impor¬ 
tant role in harnessing thermal energy as a source of 
mechanical power. In geothermal problems we are usually 
concerned with the loss of heat from the Earth per unit 
area of its surface. This quantity is called the heat flux (or 
more commonly heat flow)-, it is the amount of heat that 
flows per second across a square meter of surface. The 
mean heat flow from the Earth is very small and is mea¬ 
sured in units of milliwatt per square meter (mW m -2 ). 

The addition of a quantity of heat A Q raises the tem¬ 
perature by an amount AT, which is proportional to A O. 
The larger the mass m of the body, the smaller is the tem¬ 
perature change, and a given amount of heat produces 
different temperature changes in different materials. The 
amount of heat needed to raise the temperature of 1 kg of 
a material by 1 K is called its specific heat, denoted c p for a 
process that occurs at constant pressure (and c v when it 
happens at constant volume). These observations are 
summarized in the equation 

A(? = c p m AT (4.25) 

The added heat causes a fractional change of volume 
that is proportional to the temperature change but which 
differs from one material to another. The material prop¬ 
erty is called the volume coefficient of expansion a, and is 
defined by the equation 

“ = f(<3t)p (4 ' 26) 

When thermal energy is added to a system, part is used 
to increase the internal energy of the system - i.e., the 
kinetic energy of the molecules - and part is expended as 
work, for example, by changing the volume. If the change 
in total energy A Q occurs at constant temperature T, we 
can define a new thermodynamic parameter, the entropy 
S, which changes by an amount AS equal to A 01T. Thus 
we can write 

AQ= TAS = AE/+AJT (4.27) 

where A U is the change of internal energy and A IT is the 
work done externally. A thermodynamic process in which 
heat cannot enter or leave the system is said to be adiabatic. 


The entropy of an adiabatic reaction remains constant: 
AS = 0. This is the case when a process occurs so rapidly 
that there is no time for heat transfer. An example is the 
passage of a seismic wave in which the compressions and 
rarefactions occur too rapidly for heat to be exchanged. 
The adiabatic temperature gradient in the Earth serves as 
an important reference for estimates of the actual tempera¬ 
ture gradient and for determining how heat is transferred. 

4.2.3 Temperature inside the Earth 

In contrast to the radial distributions of density, seismic 
velocity and elastic parameters, which are known with a 
good measure of reliability, our knowledge of the temper¬ 
ature inside the Earth is still imprecise. The temperature 
can only be measured in the immediate vicinity of the 
Earth's surface, in boreholes and deep mines. As early as 
1530 Georgius Agricola (the latinized name of Georg 
Bauer, a German physician and pioneer in mineralogy and 
mining) noted that conditions were warmer in deep mines. 
In fact, near-surface temperatures increase rapidly with 
depth by roughly 30 K km -1 . At this rate, linear extrapola¬ 
tion would give a temperature around 200,000 K at the 
center of the Earth. This is greater than the temperature of 
the surface of the Sun and is unrealistically high. 

The conditions of high temperature and pressure in 
the deep interior can be inferred from experiments, and 
the adiabatic and melting-point temperatures can be com¬ 
puted with reasonable assumptions. Nevertheless, the 
temperature-depth profile is poorly known and conjec¬ 
tured temperatures have ranged widely. Limits are placed 
on the actual temperature by the known physical state of 
the Earth’s interior deduced from seismology. The tem¬ 
perature in the solid inner core must be lower than the 
melting point, while the temperature of the molten outer 
core is above the melting point. Similarly the temperature 
in the solid mantle and crust are below the melting point; 
the asthenosphere has low rigidity because its tempera¬ 
ture comes close to the solidus (“softening point”). The 
relationship of the actual temperature to the melting 
point determines how different parts of the Earth’s inte¬ 
rior behave Theologically (see Section 2.8). 

The experimental approach to estimating the variation 
of temperature with depth combines knowledge obtained 
from seismology with laboratory results. The travel-times 
of seismic body waves show that changes in mineral 
structure (phase transitions) occur at certain depths (see 
Section 3.7.5). Important examples are the olivine-spinel 
transition at 400 km depth and the spinel-perovskite tran¬ 
sition at 670 km depth in the upper mantle. The conditions 
of temperature and pressure (and hence depth) at which 
these phase transitions take place can be observed in labo¬ 
ratory experiments, so that the temperatures at the transi¬ 
tion depths in the Earth can be determined. Similarly, the 
depth variation of the melting points of mantle rocks and 
the iron-nickel core can be inferred from laboratory 
observations at high pressure and temperature. Seismic 
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velocities in the Earth are now so well known that devia¬ 
tions from normal velocities can be determined by seismic 
tomography (see Section 3.7.6) and interpreted in terms of 
temperature anomalies. 


4.2.3.1 The adiabatic temperature gradient 

An alternative way of estimating temperature inside the 
Earth is by using physical equations in which the parame¬ 
ters are known from other sources. In the late nineteenth 
century James Clerk Maxwell expressed the laws of ther¬ 
modynamics in four simple equations involving entropy 
(S'), pressure (p ), temperature (T) and volume ( V ). One of 
these equations is 



(4.28) 


The left side is the adiabatic change in temperature 
with pressure, from which we obtain the adiabatic change 
in temperature with depth by substituting dp = pg dr, as 
in Section 3.7.4. Substituting from Eqs. (4.25) and (4.26) 
we get 


( T a VAT T a 
\ dp /s c p /«A T P c P 

from which we obtain 


(4.29) 


=T °n (4.30) 

\ dz /adiabatic ^ p 

The dependence of density and gravity on depth z 
are known from seismic travel-times, and the profiles 
of a and c p can be estimated from laboratory observa¬ 
tions (Fig. 4.13). For example, in the lower mantle at 
a depth of 1500km, g = 9.9ms -2 , c p = 1200Jkg -1 K -1 , 
a= 14 X 10 -6 K -1 , and T=2400K. This gives an adia¬ 
batic temperature gradient of about 0.3 K km -1 . In 
the outer core at about 3300 km depth the correspond¬ 
ing values are: g= 10.1m s -2 , c p = 700J kg -1 K -1 , 
ct=14xl0 -6 K -1 , and T=4000K and the adiabatic 
temperature gradient is about 0.8 K km -1 . 

Approximate estimates of adiabatic temperatures 
inside the Earth can also be obtained with the aid of 
the Gruneisen thermodynamic parameter, y. This is a 
dimensionless parameter, defined as 

aK 

T=p^ < 4 - 31 > 




Depth (km) Depth (km) 


Fig. 4.13 Variations with depth in the Earth of (a) specific heat at 
constant pressure, (b) volume coefficient of thermal expansion, (c) 
Gruneisen parameter, and (d) gravity (based upon data from Stacey, 
1992). 


where p is the density. Substituting K s = p dp/dp in Eq. 
(4.29) gives 


d t = T y _ T yd P 

d p K s pdp 

d T _ dp 
T y P 

T = T »{kh 


(4.33) 

(4.34) 


With this equation, and knowing the temperature T 0 and 
density p 0 at a given depth, the adiabatic temperature can 
be computed from the density profile in a region where 
the Gruneisen parameter y is known. Fortunately, y is 
fairly constant within large regions of the Earth’s interior 
(Fig. 4.13). Clearly, Eq. (4.34) cannot be applied across a 
boundary between these domains, where y is discontinu¬ 
ous. If T 0 and p 0 are known at calibration points, the adi¬ 
abatic temperature profile may be computed iteratively 
within a particular depth interval. A current estimate of 
the temperature profile in the Earth (Fig. 4.14) has steep 
gradients in the lithosphere, asthenosphere and in the D" 
layer above the core-mantle boundary. It indicates a tem¬ 
perature near 3750 K at the core-mantle boundary and a 
central temperature of about 5100 K. 


where K s is the adiabatic incompressibility or bulk 
modulus. It is defined in Section 3.2.4 and Eq. (3.17), 
which, by writing dp instead of p and d VIV for the dilata¬ 
tion 6 , becomes 

(4.32) 


4.23.2 The melting point gradient 

Another of Maxwell’s thermodynamic equations is 



, _ „dV_ dp 
dp K sV K s p 


P 


(4.35) 
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Fig. 4.14 Variations of estimated temperature and melting point with 
depth in the Earth (based upon data from Stacey, 1992). 


This equation can be applied to the effect of pressure on 
the melting point of a substance (T mp ). The heat required 
to melt a unit mass of the substance is its latent heat of 
fusion ( L ), so the change in entropy on the left-hand side of 
the equation is equal to (mLIT mp ). The volume change is 
the difference between that of the solid phase ( V $ ) and that 
of the liquid phase ( V L ), so Eq. (4.35) can be rewritten 


d7m p 
d p 


T 

—(Fc 
mL ' s 


VO 


(4.36) 


This is known to physicists as the Clausius-Clapeyron 
equation. It describes the effect of pressure on the melting 
point, and it is of interest to us because we can easily 
convert it to give the variation of melting point with 
depth, assuming that the pressure is hydrostatic so that 
d p = pg d: as previously. For a given mass m of the sub¬ 
stance we can replace the volumes V s and V L with the cor¬ 
responding densities p s and p L of the solid and liquid 
phases, respectively, so that 


1 d7 mp = g/p s \ 
T mp dz LW V 


(4.37) 


Again, to obtain the depth distribution of the melting 
point the variations of density and gravity with depth 
in the Earth are needed. At outer core pressures the densi¬ 
ties of the solid and liquid phases of iron are about 
13,000 kg m -3 and 11,000 kg m -3 , respectively, and the 
latent heat of fusion of iron is about 7 X 10 6 J kg -1 , so the 


gradient of the melting point curve in the outer core is 
about 1K km -1 , i.e., the melting point in the core 
increases more steeply with depth than the adiabatic tem¬ 
perature. The computations of the adiabatic and melting 
temperature curves depend on parameters (e.g., L, a, c ) 
that are not known with a great degree of reliability in the 
Earth so the temperature profiles (Fig. 4.14) will undoubt¬ 
edly change and become more secure as basic knowledge 
improves. 

One factor that must still be evaluated is the role of phase 
transitions in the mantle. The D" layer just above the 
core-mantle boundary evidently plays a crucial role in 
transferring heat from the core to the mantle. It constitutes 
a thermal boundary layer. Likewise the lithosphere forms a 
thermal boundary layer that conveys mantle heat to the 
Earth’s surface. It appears unlikely that the phase transition 
at 400 km constitutes a thermal boundary layer but the 
phase transition at 670 km depth may do so. In the model 
used to derive the temperature profiles in Fig. 4.14 the phase 
transitions do not act as thermal boundary layers. 
Throughout most of the mantle the temperature gradient is 
assumed to equal the adiabatic gradient, but the mantle is 
bounded at top and bottom by thermal boundary layers 
(the lithosphere and D"-layer, respectively) in which the 
temperature gradient greatly exceeds the adiabatic gradient. 

4.2.4 Heat transport in the Earth 

Heat can be transported by three processes: conduction, 
convection and radiation. Conduction and convection 
require the presence of a material; radiation can pass 
through space or a vacuum. Conduction is the most signif¬ 
icant process of heat transport in solid materials and thus 
it is very important in the crust and lithosphere. However, 
it is an inefficient form of heat transport, and when the 
molecules are free to move, as in a fluid or gas, the process 
of convection becomes more important. Although the 
mantle is solid from the standpoint of the rapid passage of 
seismic waves, the temperature is high enough for the 
mantle to act as a viscous fluid over long time intervals. 
Consequently, convection is a more important form of 
heat transfer than conduction in the mantle. Convection is 
also the most important form of heat transport in the fluid 
core, where related changes in the geomagnetic field show 
that the turnover of core fluid is rapid in geological terms. 
Radiation is the least important process of heat transport 
in the Earth. It is only significant in the hottest regions of 
the core and lower mantle. The absorption of radiant 
energy by matter increases its temperature and thereby the 
temperature gradient. Hence, thermal radiation can be 
taken into account as a modification of the ability of the 
material to transfer heat by conduction. 

4.2.4.'} Conduction 

Thermal conduction takes place by the transfer of kinetic 
energy between molecules or atoms. A true understanding 
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of the processes involved would require us to invoke 
quantum theory and the so-called “band theory of 
solids,” but a general understanding is possible without 
resorting to such measures. The electrons in an atom that 
are most loosely bound - the valence electrons - are essen¬ 
tially free of the ionic cores and can move through a mate¬ 
rial, so transferring kinetic energy. Hence they are called 
conduction electrons. Because electrons are electrically 
charged, the net movement of conduction electrons also 
causes an electrical current. Not surprisingly materials 
that are good electrical conductors (e.g., silver, copper) 
also conduct heat well. In this atomic view the conduction 
electrons move at very high speeds (—1000 km s -1 ) but in 
random directions so that there is no net energy transfer in 
any particular direction. In an electrical or temperature 
field the conduction electrons drift systematically down 
the slope of the field (i.e., in the direction of the electrical 
field or temperature gradient). The additional drift veloc¬ 
ity is very small (about 0.1mm s~ r ) but it passes kinetic 
energy through the material. This form of conduction is 
possible in liquids, gases or solids. 

An additional mechanism plays an important role in 
conduction in solids. The atoms in a solid occupy definite 
positions that form a lattice with a certain symmetry. The 
atoms are not stationary but vibrate at a frequency that is 
temperature dependent. The lattice vibrational energy is 
quantized, forming units called phonons. An increase in 
temperature at one end of a solid raises the lattice vibra¬ 
tional frequency there. Due to the coupling between atoms 
the increased vibration is eventually passed through the 
lattice as an increase in temperature. 

The relative importance of electrons and phonons in 
conducting heat differs from one solid to another. In 
metals, which contain large numbers of conduction elec¬ 
trons, thermal transport is due largely to the electrons; the 
lattice conductivity is barely measurable. In an insulator 
or poor conductor, such as the minerals of the crust and 
mantle, there are few conduction electrons and thermal 
conductivity is largely determined by lattice vibrations 
(phonons). 

The transport of heat by conduction in a solid is gov¬ 
erned by a simple equation. Consider a solid bar of 
length L and cross-sectional area A with its ends main¬ 
tained at temperatures 7) and T v respectively (Fig. 
4.15a). Assuming that heat flows only along the bar (i.e., 
there are no side losses) the net amount of heat ( A Q) that 
passes in a given time from the hot end to the cold end 
depends directly on the temperature difference (T 2 — 7)), 
the area of cross-section (A) and the time of observation 
(At), and inversely on the length of the bar (L). These 
observations can be summarized in the equation 

T — T 

Ai Q = kA 2 L * A t (4.38) 

The constant of proportionality, k, is the thermal con¬ 
ductivity, which is a property of the material of the bar. If 
the length of the bar is very small or the temperature 
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Fig. 4.15 (a) Conduction of heat 0 through a bar of length L and 
cross-sectional area A, with its ends kept at temperatures 7) and T 2 
(>7",). (b) Heat flux entering (g z ) and leaving (g z - dg z ) a short bar of 
length dz. 


change across it is uniform, the ratio (7) — T { )IL is the 
temperature gradient. We can modify the equation to 
describe the vertical flow of heat out of the Earth by sub¬ 
stituting the vertical temperature gradient, (d77dr), which 
is also called the geothermal gradient. Equation (4.38) can 
then be rearranged as follows 


id Q 
A dr 



(4.39) 


In this equation q. is the heat flux, defined as the flow of 
heat per unit area per second. The negative sign is needed 
to account for the direction of the heat flow; if tempera¬ 
ture increases in the downward direction of the r-axis, the 
flow of heat from high to low temperature is upward. The 
mean value of the undisturbed geothermal gradient near 
to the Earth’s surface is about 30 °C km -1 , with low 
values of around 10 °C km -1 in ancient crust and high 
values of around 50 °C km -1 in young active zones. 

The change of temperature within a body is described 
by the heat conduction equation, which is solved in Section 
4.2.6 for special situations that are of interest for the trans¬ 
fer of thermal energy in the Earth. Conduction is a slow, 
and less effective means of heat transport than convection. 
It is important in the rigid crust and lithosphere, where 
convection cannot take place. However, it cannot be 
neglected in the fluid core, which is metallic and therefore a 
good conductor. A significant part of the core’s heat is con¬ 
ducted out of the core along the adiabatic temperature gra¬ 
dient. The remainder, in excess of the conductive heat flow, 
is transported by convection currents. 
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42.4.2 Convection 

Suppose that a small parcel of material at a certain depth 
in the Earth is in thermal equilibrium with its surround¬ 
ings. If the parcel is displaced vertically upward without 
gaining or losing heat, it experiences a drop in pressure 
accompanied by a corresponding loss in temperature. If 
the new temperature of the parcel is the same as that of its 
surroundings at the new depth, the conditions at each 
depth are in adiabatic equilibrium. The variation of tem¬ 
perature with depth then defines the adiabatic tempera¬ 
ture curve. 

Now suppose that the real temperature increases with 
depth more rapidly than the adiabatic temperature gradi¬ 
ent. The temperature loss of the upwardly displaced parcel 
is due to the change in pressure, which will be the same as in 
the previous case. But the real temperature has dropped by 
a larger amount, so the parcel is now hotter and therefore 
less dense than its surroundings. Its buoyancy causes it to 
continue to rise until it reaches a level where it is in equilib¬ 
rium or can rise no further. Meanwhile, the volume vacated 
by the displaced parcel is occupied by adjacent material. 
Conversely, if a parcel of material is displaced downward, 
it experiences adiabatic increases in pressure and tempera¬ 
ture. The temperature increase is less than required by the 
real temperature gradient, so the parcel remains cooler 
than its surroundings and sinks further. A pattern of cycli¬ 
cal behavior arises in which material is heated up and rises, 
while cooler material sinks to take its place, and is in turn 
heated up and rises, and so on. The process is called thermal 
convection and the physical transportation of material and 
heat is called a convection current. 

The difference between the real and adiabatic tempera¬ 
ture gradients is the superadiabatic temperature gradient, 
9. For thermal convection to take place in a fluid, 9 must 
be positive. Suppose that the temperature at a certain 
depth exceeds the adiabatic temperature by an amount 
A T. The temperature excess causes a volume V of fluid to 
expand by an amount proportional to the volume 
coefficient of expansion, a; this causes a mass deficiency 
of (Vpa AT). Archimede’s principle applies, so the hot 
volume V experiences a buoyancy force given by: 

F b = VpgaAT (4.40) 

Two effects inhibit the hot volume from rising. First, 
some of the heat that would contribute to the buoyancy 
is removed by thermal conduction; the efficacy of this 
process is expressed by the thermal diffusivity k of 
the material, which depends on its density p, thermal 
conductivity k and specific heat c p (see Section 4.2.6). 
Second, as soon as the overheated volume of fluid begins 
to rise, it experiences a resisting drag due to the viscosity 17 
of the fluid. The effects combine to produce a force, pro¬ 
portional to K 17 , which opposes convection. If the volume 
V involved in the convection has a typical dimension D, 
so that V~ Z) 3 , we can define a dimensionless number Ra, 


the Rayleigh number, which is proportional to the ratio of 
the buoyancy force to the diffusive-viscous force: 


Ra = 


gpaAT 

k*n U 


(4.41) 


Initially, heat passes through the material by conduc¬ 
tion, but the diffusion takes some time. If the heat flux is 
large enough, it cannot diffuse entirely. The temperature 
rises above the adiabatic and buoyancy forces develop. 
For convection to occur, the buoyancy forces must domi¬ 
nate the resisting forces. This does not happen until the 
Rayleigh number exceeds a critical value, which is deter¬ 
mined additionally by the boundary conditions and the 
geometry of the convection. For example, the condition 
for the onset of convection in a thin horizontal fluid layer, 
heated from below and with the top and bottom surfaces 
free from stress, was shown by Lord Rayleigh in 1916 to 
depend on the value of Ra given by 

Ra = ^£ 4 (4.42) 


Here D is the layer thickness, 9 is the superadiabatic tem¬ 
perature gradient and v (equal to pp) is the kinematic vis¬ 
cosity. Convection begins in the flat layer if Ra is greater 
than 27 tt 4 /4 = 658. In cases with different boundary con¬ 
ditions, or for convection to occur in a spherical shell, the 
critical Rayleigh number is higher. However, convection 
generally originates if Ra is of the order of 10 3 and when 
Ra reaches around 10 5 heat transport is almost entirely by 
convection with little being transferred by diffusion. 

For convection to occur, the real temperature gradient 
must exceed the adiabatic gradient. However, the loss of 
heat by convection reduces the difference between the gra¬ 
dients. Accordingly, the adiabatic gradient evolves as a 
convecting fluid cools. An important effect of convection 
is to keep the temperature gradient close to the adiabatic 
gradient. This condition is realized in the Earth’s fluid 
core, where convection is the major mechanism of heat 
transport. Thermal convection is augmented by composi¬ 
tional convection related to the solidification of the inner 
core. The core fluid is made up of iron, nickel and lower- 
density elements, e.g., sulfur. Solidification of the inner 
core separates the dense iron from the lower-density ele¬ 
ments at the inner core boundary. Being less dense than 
the core fluid, the residual materials experience an 
upward buoyancy force, resulting in a cycle of composi- 
tionally driven convection. Thermal and compositional 
convection in the Earth’s core each act as a source of the 
energy needed to drive the geomagnetic field, with com¬ 
positional convection the more important type. 

Convection is the most important process of thermal 
transport in the fluid core, but it is also important in the 
mantle. The material of the Earth’s mantle is rigid to the 
short-lived passage of seismic waves but is believed to 
yield slowly over long periods of time (Section 2.8.6). 
Although the mantle viscosity is high, the time-scale of 
geological processes is so long that long-term flow can 
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take place. The flow patterns are dominated by thermal 
convection and are influenced by the presence of thermal 
boundary layers, which the flowing material cannot cross. 
However, convection is a more effective mechanism than 
conduction and it is thought to be the dominant process 
of heat transfer in the mantle (Section 4.2.9). 

A further process of heat transfer that involves bodily 
transport of matter is advection. This can be regarded as a 
form of forced convection. Instead of being conveyed by 
thermally produced buoyancy, advected heat is trans¬ 
ported in a medium that is itself driven by other forces. 
For example, in a thermal spring the flow of water is due 
to hydraulic forces and not to density differences in the 
hot water. Similarly, volcanic eruptions transport 
advected heat along with the lava flow, but this is pro¬ 
pelled by pressure differences rather than by buoyancy. 

4.2.43 Radiation 

Atoms can exist in many distinct energy states. The most 
stable is the ground state, in which the energy is lowest. 
When an atom changes from an excited state to a lower- 
energy state, it is said to undergo a transition. Energy cor¬ 
responding to the difference in energy between the states 
is emitted as an electromagnetic wave, which we call radi¬ 
ation. Quantum physics teaches that the radiant energy 
emitted consists of a discrete number of fundamental 
units, called quanta. The particular wavelength of the 
electromagnetic radiation associated with a transition is 
proportional to the energy difference between the two 
states. If several different transitions are taking place 
simultaneously, the body emits a spectrum of wave¬ 
lengths. Radio signals, heat, light, and x-rays are exam¬ 
ples of electromagnetic radiation that have different 
wavelengths. The electromagnetic wave consists of fluctu¬ 
ating electric and magnetic fields, which need no medium 
for their passage. For this reason, radiation can travel 
through space or a vacuum. In materials it may be scat¬ 
tered or absorbed, depending on its wavelength. Heat 
radiation corresponds to the infrared part of the electro¬ 
magnetic spectrum with wavelengths just longer than 
those of visible light. 

The radiation of a commonplace hot object depends 
on factors that are difficult to assess. Classical physics fails 
to explain adequately the absorption and emission of 
radiation. To provide an explanation physicists introduced 
the concept of a black body as a perfect absorber and 
emitter of radiation. At any temperature it emits a contin¬ 
uous spectrum of radiation; the frequency content of the 
spectrum does not depend on the material composition of 
the body but only on its temperature. An ideal black body 
does not exist in practice, but it can be approximated by a 
hollow container that has a small hole in its wall. When 
the container is heated, the radiation escaping through the 
hole - so-called cavity radiation - is effectively black-body 
radiation. In 1879 Josef Stefan pointed out that the loss of 
heat by radiation from a hot object is proportional to the 


fourth power of the absolute temperature. If R represents 
the radiant energy per second emitted per unit area of the 
surface of the body at temperature T, then 

R = dT 4 (4.43) 

where cr, known as Stefan’s constant or the Stefan- 
Boltzmann constant, has the value 5.6704 X 10 -8 
W m -2 K -4 . 

In 1900, Max Planck, professor of physics at the uni¬ 
versity of Berlin, proposed that an oscillator could only 
have discrete amounts of energy. This was the birth of 
quantum theory. The energy of an oscillator of fre¬ 
quency v is equal to the product liv, where the universal 
constant h (known as Planck’s constant) has the value 
6.626 X 10 -34 J s. The application of quantum princi¬ 
ples to black-body radiation provides a satisfactory 
explanation of Stefan’s law and allows Stefan’s constant 
to be expressed in terms of other fundamental physical 
constants. 

Radiation is reflected and refracted in a transparent 
medium wherever the refractive index n changes; energy is 
transferred to the medium in each of these interactions. 
The transparency of the medium is determined by the 
opacity e, which describes the degree of absorption of 
electromagnetic radiation. The opacity is wavelength 
dependent. In an ionic crystal the absorption of infrared 
radiation is large. It alters the vibrational frequency, and 
thereby influences the ability of the crystal lattice to 
transport heat by conduction. Thus the effect can be 
taken into account by increasing the conductivity by an 
extra radiative amount, k T , given by 

/c r = y (4.44) 

The r 3 -dependence in this expression suggests that 
radiation might be more important than lattice conduc¬ 
tivity in the hotter regions of the Earth. In fact other 
arguments lead to the conclusion that this is probably not 
the case in the upper mantle, because the effect of increas¬ 
ing temperature is partly offset by an increase in the 
opacity, e. The lower mantle is believed to have a high 
density of free electrons, which efficiently absorb radia¬ 
tion and raise the opacity. This may greatly reduce the 
efficacy of heat transfer by radiation in the mantle. 

4.2.5 Sources of heat in the Earth 

The interior of the Earth is losing heat via geothermal 
flux at a rate of about 4.4 X 10 13 W, which amounts to 
1.4 X 10 21 J yr _1 (see Table 4.3). The heat is brought to the 
surface in different ways. The creation of new lithosphere 
at oceanic ridges releases the largest fraction of the 
thermal energy. A similar mechanism, the spreading of 
the sea-floor, releases heat in the marginal basins behind 
island arcs. Rising plumes of magma originating deep in 
the mantle bring heat to the surface where they break 
through the oceanic or continental lithosphere at 
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Table 4.4 Estimates of radioactive heat production in selected rock types, based on heat production rates (from Rybach, 
1976,1988) and isotopic concentrations 


Concentration Heat production 

[p.p.m. by weight] [10 _11 Wkg _1 ] 


Rock type 

U 

Th 

K 

U 

Th 

K 

Total 

Granite 

4.6 

18 

33,000 

43.8 

46.1 

11.5 

101 

Alkali basalt 

0.75 

2.5 

12,000 

7.1 

6.4 

4.2 

18 

Tholeiitic basalt 

0.11 

0.4 

1,500 

1.05 

1.02 

0.52 

2.6 

Peridotite, dunite 

0.006 

0.02 

too 

0.057 

0.051 

0.035 

0.14 

Chondrites 

0.015 

0.045 

900 

0.143 

0.115 

0.313 

0.57 

Continental crust 

1.2 

4.5 

15,500 

11.4 

11.5 

5.4 

28 

Mantle 

0.025 

0.087 

70 

0.238 

0.223 

0.024 

0.49 


“hotspots,” characterized by intense localized volcanic 
activity. These important thermal fluxes are superposed 
on a background consisting of heat flowing into and 
through the lithosphere from deeper parts of the earth. 
There are two main sources of the internal heat. Part of it 
is probably due to the slow cooling of the Earth from an 
earlier hotter state; part is generated by the decay of long- 
lived radioactive isotopes. 

The early thermal history of the Earth is obscure and a 
matter of some speculation. According to the cold accre¬ 
tion model of the formation of the planets (see Section 
1.1.4), colliding bodies in a primordial cloud of dust and 
gas coalesced by self-gravitation. The gravitational col¬ 
lapse released energy that heated up the Earth. When the 
temperature reached the melting point of iron, a liquid 
core formed, incorporating also nickel and possibly sulfur 
or another light element associated with iron. The 
differentiation of a denser core and lighter mantle from 
an initially homogeneous fluid must have released further 
gravitational energy in the form of heat. The dissipation 
of Earth’s initial heat still has an important effect on 
internal temperatures. 

Energy released by short-lived radioactive isotopes may 
have contributed to the initial heating, but the short-lived 
isotopes would be consumed quite quickly. The heat gener¬ 
ated by long-lived radioactive isotopes has been an impor¬ 
tant heat source during most of Earth’s history. These 
isotopes separated into two fractions: some, associated 
with heavy elements, sank into the core; some, associated 
with lighter elements, accumulated in the crust. The present 
distribution of radiogenic sources within the differentiated 
Earth is uneven. The highest concentrations are in the rocks 
and minerals of the Earth’s crust, while the concentrations 
in mantle and core materials are low, However, continuing 
generation of heat by radioactivity in the deep interior, 
though small, may influence internal temperatures. 

4.2.5.1 Radioactive heat production 

When a radioactive isotope decays, it emits energetic parti¬ 
cles and y-rays. The two particles that are important in 


radioactive heat production are a-particles and (3-particles. 
The a-particles are equivalent to helium nuclei and are 
positively charged, while (3-particles are electrons. In order 
to be a significant source of heat a radioactive isotope must 
have a half-life comparable to the age of the Earth, the 
energy of its decay must be fully converted to heat, and the 
isotope must be sufficiently abundant. The main isotopes 
that fulfil these conditions are 238 U, 235 U, 232 Th and 40 K. 
The isotope 235 U has a shorter half-life than 238 U (see Table 
4.1) and releases more energy in its decay. In natural 
uranium the proportion of 238 U is 99.28%, that of 235 U is 
about 0.71%, and the rest is 234 U. The abundance of the 
radioactive isotope 40 K in natural potassium is only 
0.01167%, but potassium is a very common element and its 
heat production is not negligible. The amounts of heat 
generated per second by these elements (in p,W kg -1 ) are: 
natural uranium, 95.2; thorium, 25.6; and natural potas¬ 
sium, 0.00348 (Rybach, 1976,1988). The heat Q r produced 
by radioactivity in a rock that has concentrations C v , C Th 
and C K , respectively, of these elements is 

Q r = 95.2Cu + 25.6C Th + 0.00348C K (4.45) 

Rates of radioactive heat production computed with 
this equation are shown for some important rock types in 
Table 4.4. Chondritic meteorites, made up of silicate min¬ 
erals like olivine and pyroxene, are often taken as a proxy 
for the initial composition of the mantle; likewise, the 
olivine-dominated rock dunite represents the ultramafic 
rocks of the upper mantle. It is apparent that very little 
heat is produced by radioactivity in the mantle or in the 
basaltic rocks that dominate the oceanic crust and lower 
continental crust. The greatest concentration of radiogenic 
heat sources is in the granitic rocks in the upper continental 
crust. Multiplying the radioactive heat production values 
in the last column of Table 4.4 by the rock density gives the 
radiogenic heat generated in a cubic meter of the rock, A. If 
we assume that all the heat generated in a rock layer of 
thickness D meters escapes vertically, the amount crossing 
a square meter at the surface per second (i.e., the radioac¬ 
tive component of the heat flow) is DA. For example, a 
layer of granite 1 km thick contributes about 3 mW nr 2 to 
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Fig. 4.16 Dependence of surface heat flux on radioactive heat 
generation in two heat-flow provinces: (a) Sierra Nevada, (b) eastern 
United States (data source: Royefa/., 1968). 


the continental heat flow. The figures suggest that the 
10-20 km thick upper crust produces about one-half of the 
mean continental heat flow, which is 65 mW m -2 . 

In fact the relative importance of radiogenic heat in the 
crust is variable from one region to another. A region in 
which the heat flow is linearly related to the heat produced 
by radioactivity is called a heat-flow province. Some exam¬ 
ples of heat-flow provinces are Western Australia, the 
Superior Province in the Canadian Shield, and the Basin- 
and-Range Province in the western United States. As 
shown in Fig. 4.16 each province is characterized by a 
different linear relation between q and A, such that 

q = q r + DA (4.46) 


Heat-flow province 

Basin & Range 
Sierra Nevada 
Eastern USA 
Canada (Superior) 
Ukraine 
England & Wales 
Western Australia 
Central Australia 

Southeastern Appalachians 
Brazil (coastal shield) 
Baltic shield 
Indian shield (Proterozoic) 
Indian shield 


Heat flow (mW m 2 ) 
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Fig. 4.17 Mean heat flow, reduced heat flow and characteristic 
thickness of the layer of radioactive heat production in several heat- 
flow provinces (data source: Vitorello and Pollack, 1980). 

the surface to infinite depth gives the total radioactive heat 
production: 


The parameters q x and D typify the heat-flow province. 
The intercept of the straight line with the heat-flow axis, 
q , is called the reduced heat flow. This is the heat flow that 
would be observed in the province if there were no radio¬ 
genic crustal heat sources. It is due in part to the heat 
flowing from deeper regions of the Earth into the base of 
the crustal layer, and partly to cooling of the originally 
hotter upper crustal layer. Investigations in different heat- 
flow provinces show that the reduced heat flow averages 
about 55% of the mean measured heat flow in a province 
(Fig. 4.17). 

The simplest interpretation of D is to regard it as a 
characteristic thickness of crust involved in radioactive 
heat production. This assumes that the radiogenic heat 
sources are distributed uniformly in a crustal slab of con¬ 
stant thickness, which is an unlikely situation. A more 
likely model is that the radioactive heat generation 
decreases with depth. Assuming an exponential decrease, 
the heat production A(z) at depth z is related to the 
surface heat generation A 0 as 

A(z)=A 0 e~-- /D (4.47) 

where D is a characteristic depth (the depth at which A(z) 
has decreased to e -1 of its surface value). Integrating from 


J A(z)dz = A 0 J e~ z/D dz = DA 0 (4.48) 

o o 

which is the same as for the uniform distribution. The 
infinite depth limit to the exponential distribution is obvi¬ 
ously unrealistic. If the radiogenic sources are distributed 
in a layer of finite thickness s, the integration becomes 

S 

A 0 je--- /D dz = DA 0 ( 1 - e“^) (4.49) 

o 

If .? is greater than three times D, this expression differs 
from DA 0 by less than 5%. The value of D estimated from 
studies of heat-flow and radioactive heat generation aver¬ 
ages about 10 km, but varies from ~4 km to ~ 16 km (Fig. 
4.17). 

The three main sources of the Earth’s surface heat flow 
are (i) heat flowing into the base of the lithosphere from 
the deeper mantle, (ii) heat lost by cooling of the lithos¬ 
phere with time, and (iii) radiogenic heat production in 
the crust. The contributions are unequal and different in 
the oceans and continents (Table 4.5). The most obvious 
disparity is in the relative importance of lithospheric 
cooling and radioactivity. The lithosphere is hot when 
created at oceanic ridges and cools slowly as it ages. The 
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Table 4.5 Approximate relative contributions (in %) of 
the main sources of heat flow in oceanic and continental 
lithosphere (from Bott, 1982) 


Contribution to heat 
flow in: 


Heat source 

Continents 

[%] 

Oceans 

[%] 

Cooling of the lithosphere 

20 

85 

Heat flow from below the 

25 

10 

lithosphere 

Radiogenic heat: 

55 

5 

upper crust 

40 

— 

rest of lithosphere 

15 

— 


loss of heat by lithospheric cooling is most pronounced in 
the oceanic crust, which moreover contains few radi¬ 
ogenic heat sources. In contrast, the older continental 
lithosphere has lost much of the early heat of formation, 
and the higher concentration of radioactive minerals 
increases the importance of radiogenic heat production. 
Regardless of its source, the passage of heat through the 
rigid outer layers takes place predominantly by conduc¬ 
tion, although in special circumstances, such as the flow 
of magma in the crust or hydrothermal circulation near to 
oceanic ridges, convection also plays an important role. 

4.2.6 The heat conduction equation 

Jean Baptiste Joseph Fourier (1768-1830), a noted 
French mathematician and physicist, developed the 
theory of heat conduction in 1822. Here we consider the 
example of one-dimensional heat flow, which typifies 
many interesting problems involving the flow of heat in a 
single direction. The equation of heat conduction is most 
easily developed for this case, from which it can be readily 
extended to three-dimensional heat flow. 

Consider the flow of heat in the negative direction of 
the z-axis through a small rectangular prism with sides 
dx, dy and dr (see Fig. 4.15b). We will assume that there 
are no sources of heat inside the box. Let the amount of 
heat entering the prism at (z + dr) be Q,. This is equal to 
the heat flow q, multiplied by the area of the surface it 
flows across (dxdyj and by the duration of the flow (dr). 
The heat leaving the box at r is Q. — dQ z , which can be 
written Q. — (d Q_ /dr)dr. The increase in heat in the small 
box is the difference between these amounts: 

dO. dr/. h 2 t 

~^r dr = dr (dx dy )dt = kj^dVdt (4. 50) 

where dFis the volume of the box (dxdydz). Note that 
Q_, q. and T are all understood to decrease in the direc¬ 
tion of flow, so we have substituted for q. from Eq. (4.39) 
without using the negative sign. The heat increase in the 
box causes its temperature to rise by an amount (d T), 


determined by the specific heat at constant pressure (c p ) 
and the mass of material (m) in the box. Using Eq. (4.25), 
we write 


c v mdT=c v pdVdT (4.51) 

where p is the density of the material in the box. If we 
equate Eq. (4.50) and Eq. (4.51) for the amount of heat 
left in the box, we get the equation of heat conduction 


dT k d 2 T 
dr P c p dr 2 

ar = flr 

dt K dz 2 


(4.52) 


where k (= kipcf is called the thermal diffusivity; it has 
the dimensions m 2 s _1 . The equation is written with 
partial differentials because the temperature T is a func¬ 
tion of both time and position: T = T{z, t ). This just 
means that, on the one hand, the temperature at a certain 
position changes with time, and, on the other hand, the 
temperature at any given time varies with position in the 
body. 

The same arguments can be applied to the components 
of heat flow through the box in the x- and y-directions. 
We obtain the three-dimensional heat flow equation (also 
called the diffusion equation ): 


ST = l d 2 T d 2 T d 2 T\ 
dt \dx 2 dy 2 dz 2 ) 


(4.53) 


The equation may be solved for any set of boundary 
conditions using the method of separation of variables 
(Box 4.1). Two important situations involving the flow of 
heat across the Earth’s surface are the heating of Earth’s 
surface by solar energy and the cooling of hot lithos¬ 
phere. These can be handled to a first approximation as 
problems of one-dimensional heat conduction. 


42.6.1 Penetration of external heat into the Earth 

Solar energy is by far the greatest of the Earth's energy 
sources (Table 4.3). In order to determine the geothermal 
flux from the Earth's interior, it is important to under¬ 
stand the effects of solar energy that reaches the Earth’s 
surface. Rocks at Earth's surface heat up during the day 
and cool down at night. The effect is not restricted to the 
immediate surface, but affects a volume of rock near the 
surface. Similarly, the mean surface temperature varies 
throughout the year with the changing seasons. The heat 
conduction equation allows us to estimate what depths 
are affected by these cyclic temperature variations. 

Suppose that the surface temperature of the Earth 
varies cyclically with angular frequency co, so that at time 
t it is equal to T 0 cos cot, where T 0 is the peak temperature 
during a cycle. The temperature at time t and depth z is 
obtained by solving the one-dimensional heat equation 
(Box 4.2) and is given by 



4.2 THE EARTH'S HEAT 


231 


Box 4.1: Method of separation of variables 

The method of separation of variables is encountered 
in the solution of several geophysical problems. It may 
be illustrated by the case of one-dimensional heat 
flow, described by Eq. (4.52). The coordinates in this 
equation are independent, i.e., each can take any value 
regardless of the other. Assume that the temperature 
T(z, t) can be written as the product of Z(z), a func¬ 
tion of position only, and 0(t), a function of time only: 

T(z,t) = Z(z)0(t) (1) 


The actual solution for T(z, t ) usually does not have 
this final form, but once Z(z) and 6(t) are known they 
can be combined to fit the boundary conditions. On 
substituting in Eq. (4.52) we get 


dO 
d t 


d 2 Z 

- K d?° 


( 2 ) 


Note that we can use full differentials here because 0 
depends only on t and Z only on z. Dividing both sides 
by ZO gives 


Id0 = 1 d 2 Z 
0 dr K Z dz 2 


(3) 


Note that in Eq. (3) the left side depends only on t and 
the right side depends only on z, and that z and t are 
independent variables. On substituting a particular 
value for the time r the left side becomes a numerical 
constant. We have not restricted z, which varies inde¬ 
pendently. Equation (3) requires that the variable right 
side remains equal to the numerical constant for any 
value of z. Conversely, if we substitute a particular 
value for z, the right side becomes a (new) numerical 
constant and the variable left side must equal this con¬ 
stant for any t. The identity inherent in Eq. (3) implies 
that each side must be equal to the same separation 
constant. Let this constant be C. Then, 


I d0 = 
0 d t 


k±^=C 
Z dz 2 


(4) 


The value of the separation constant in a particular 
problem is determined by the boundary conditions of 
the problem. 


T(z,t) = T 0 e z/rf cos(o>t —^ (4.54) 

where d = (2 k/co) 112 is the decay depth of the temperature. At 
a depth of 5 d the amplitude is less than 1% of the surface 
value and is effectively zero. Note that d depends inversely 
on the frequency, so long-period fluctuations penetrate 
more deeply than rapid fluctuations. This is illustrated by a 


quick comparison of the decay depths for daily and annual 
temperature variations in the same ground 

“Annual / (2ir/l) r-—- 

,4 ■ 55, 

i.e., the annual variation penetrates about 19 times the 
depth of the daily variation (Fig. 4.18). Moreover, the 
temperature at any depth varies with the same frequency 
co as the surface variation, but it experiences a phase shift 
or delay, reaching its maximum value progressively later 
than the surface effect (Fig. 4.18). 

As representative values for crustal rocks we take: 
density, p = 2650 kg m -3 ; thermal conductivity, k = 
2.5Wm _1 K _1 ; and specific heat, c p = 700Jkg _1 K _1 . 
These give a thermal diffusivity, #c = 1.25X 10 _6 m 2 s _1 . 
The daily temperature variation (period 86,400 s) has co = 
7.27X 10 _5 s _1 , so its penetration depth is about 20cm. 
The daily variation has negligible effect deeper than about 
a meter. Similarly, the annual temperature variation 
accompanying seasonal changes has a penetration depth 
of 3.8 m and is negligible deeper than about 19 m. Heat- 
flow measurements made within 20 m of the surface will 
be contaminated by the daily and annual variations of 
surface temperature. This is not a problem in the deep 
oceans, where the Sun’s rays never reach the bottom, but 
it must be taken into account in continental heat-flow 
measurements. A serious effect is the role of the ice ages, 
which recur on a timescale of about 100,000 yr and have a 
penetration depth of several kilometers. The measured 
temperature gradient must be corrected appropriately. 

42.6.2 Cooling of the oceanic lithosphere 

A thermodynamic problem commonly encountered in 
geology is the sudden heating or cooling of a body. For 
example, molten lava intrudes cool host rocks as a dike or 
sill virtually instantaneously, but the heat is conducted to 
the adjacent rocks over a long period of time until it 
slowly dissipates. An important case in the context of 
plate tectonics is the cooling of the oceanic lithosphere, 
with its base at the temperature of the hot mantle, T m , 
and its top in contact with cold ocean-bottom sea-water. 

Assume that the fresh hot lithosphere is created at a 
ridge axis as a thin vertical prism with a uniform initial 
temperature equal to that of the hot mantle, T m . Sea-floor 
spreading transports the lithosphere horizontally, so, for 
a constant spreading rate, distance from the ridge axis is 
proportional to the cooling time ( t ). Except in the imme¬ 
diate vicinity of a spreading ridge the heat loss may be 
assumed to be solely in the vertical (—z) direction through 
the surface at z = 0, which is in contact with cold ocean- 
bottom sea-water at a temperature of 0 °C. Although the 
lithosphere has finite vertical extent, the temperature in 
the cooling slab may be approximated as the one¬ 
dimensional cooling of a semi-infinite half-space extend¬ 
ing to infinity in the z-direction. The error introduced by 
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Box 4.2: One-dimensional heat conduction 


The surface-temperature variation T 0 coscot can be 
expressed with the aid of complex numbers (Box 2.5) as 
the real part of T 0 e' w . Let z represent the depth of a 
point below the surface. The heat conduction equation, 
Eq. (4.52), can be separated and written as two parts 
equal to the same constant. We want to match bound¬ 
ary conditions with the surface disturbance T 0 e 1 "', so 
we write the separation constant as ico. Applying the 
method of separation of variables (Box 4.1) we get for 
the two parts of the solution 


1 dd 
9 d t 


= i&> 


1 d 2 Z 
K Z dz 2 


= 1(0 


( 1 ) 

( 2 ) 


The solution of Eq. (1) gives the time dependence of the 
temperature as 


9 = 0 o e iw ' 


(3) 


In order to find the depth dependence, we can rewrite 
Eq. (2) as 


We get two possible solutions for the depth variation: 
Z = Zj e inz = Z^V " 72 ^ 1 +i> - and 


Z = Z 0 e ~' mz = Z 0 e “ 1 + >)* 


( 8 ) 


The temperature must decrease with increasing depth z 
below the surface, so only the second solution is accept¬ 
able. Combining the solutions for 9 and Z we get 


T{z,t) = Z 0 e-(V“/ 2 K )(i+i)z 0 oe i M r 
T(z,i) = Z o 0 o e _ <V l ^ c ) z e'< a " _ (V^ 2,c ) z ) 


(9) 

( 10 ) 


The surface-temperature variation T 0 coswt was 
expressed as the real part of T 0 e !wt . Taking the real part 
of Eq. (10) we get for the temperature T(z,t) at time t 
and depth z 

T(z,t) =Z o 0 o e“^z C os(wt-_^|zj (11) 

Writing T 0 = Z 0 9 0 and d = ^(2k/co) the equation 
reduces to 


d 2 Z • 


dz 2 


-i^Z = 0 


(4) 


Writing — n 2 = i co/k, this is equivalent to the harmonic 
equation 


d 2 Z 


+ n 2 Z = 0 


dz 2 

which has the solutions 


Z = Z 0 e m: and Z = Z l e mz 


(5) 


( 6 ) 


where 


T(z,t) = T 0 e z/<7 cos [o>t - 


( 12 ) 


The parameter d is called the decay depth of the temper¬ 
ature. At this depth the amplitude of the temperature 
fluctuation is attenuated to 1 /e of its value on the 
surface. Eq. (12) can also be written in the form 

T(z,t) = T 0 e~ z/d cosco (t- t d ) ( 13 ) 

where the phase difference, or delay time, 
t = Z- 

ld cod (14) 



represents the length of time by which the temperature 

(7) 

at depth z lags behind the surface temperature. 


this simplification is small, and the simple model can be 
used to estimate the heat flow from the cooling lithos¬ 
phere with acceptable accuracy. 

The one-dimensional cooling of a semi-infinite half¬ 
space is described in Appendix B. The solution with the 
stated boundary conditions involves the error function, 
which is described in Box 4.3. The error function (erf) and 
complementary error function (erfc) of the parameter x 
are defined as 



erfc(.v) = 1 - erf(x) 


(4.56) 


The shapes of these functions are shown in Fig. 4.19; 
their values for any particular value of x are obtained 


from tables, just like other statistical or trigonometric 
functions. The complementary error function sinks 
asymptotically to zero, and is effectively zero for x> 2 . 
The temperature T at depth z and time t after the half¬ 
space starts to cool is given by 


T= T erf i 


2 a/k7 


= T e rf(rj) with 17 = 


2^Kt 


(4.57) 


where T m is the temperature of the hot mantle, k is 
the thermal diffusivity of the half-space, and the surface 
(z = 0 ) is at the temperature of the ocean floor, which is 
taken to be 0 °C. 
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Fig. 4.18 Temperature variations at various depths in a sandy soil: (a) 
daily fluctuations, (b) annual (seasonal) variations. 


The heat loss from the cooling lithosphere can be com¬ 
puted from the temperature distribution in Eq. (4.57). The 
heat flow is proportional to the temperature gradient (Eq. 
(4.39)). The heat flowing out of a semi-infinite half-space 
is thus obtained by differentiating Eq. (4.57) with respect 
to z: 


,dT = _,dV9T 
q - K dz K dz dr) 

= ~ k WtU T ^v) } 


kT m d 2 
2v/k? v/t r 


e “ 2 du 


(4.58) 


which simplifies to 


< 7 .- = 


^[rTKt 


(4.59) 


At the surface, z = 0 ,17 = 0, and exp(— 17 2 ) = 1. The surface 
heat flow at time t is 



The negative sign here indicates that the heat flows upward, 
in the direction of decreasing z. The semi-infinite half-space 
is quite a good model for the cooling of oceanic lithos¬ 
phere. The oceanic heat flow indeed varies with distance 
from an oceanic ridge as 1 /y/r, where t is the correspond¬ 
ing age of the lithosphere. Models for the cooling of 
oceanic lithosphere are discussed further in Section 4.2. 8 .3. 

4.2.7 Continental heat flow 

The computation of heat flow at a locality requires 
two measurements. The thermal conductivities of a 
representative suite of samples of the local rocks are 
measured in the laboratory. The temperature gradient is 
measured in the field at the investigation site. At conti¬ 
nental sites this is usually carried out in a borehole (Fig. 
4.20). There are several ways of determining the temper¬ 
ature in the borehole. During commercial drilling the 
temperature of the drilling fluid can be measured as it 
returns to the surface. This gives a more or less continu¬ 
ous record, but is influenced strongly by the heat gener¬ 
ated during the drilling. At times when drilling is 
interrupted, the bottom-hole temperature can be mea¬ 
sured. Both of these methods give data of possibly com¬ 
mercial interest but they are too inaccurate for heat-flow 
determination. 

In-hole measurements of temperature for heat-flow 
analyses are made by lowering a temperature-logging tool 
into the borehole and continuously logging the tempera¬ 
ture during its descent. The circulation of drilling fluids 
redistributes heat in the hole, so it is necessary to allow 
some time after drilling has ceased for the hole to return 
to thermal equilibrium with the penetrated formations. 
The temperature of the water in the hole is taken to be the 
ambient temperature of the adjacent rocks, provided 
there are no convection currents. 

The most common devices for measuring temperature 
are the platinum resistance thermometer and the thermis¬ 
tor. A thermistor is a ceramic solid-state device with an 
electrical resistance that is strongly dependent on temper¬ 
ature. Its resistance depends non-linearly on temperature, 
requiring accurate calibration, but the sensitivity of the 
device makes feasible the measurement of temperature 
differences of 0.001-0.01 K. The platinum resistance ther¬ 
mometer and the thermistor are used in two basic ways. 
In one method the sensor element constitutes an arm of a 
sensitive Wheatstone bridge, with which its resistance is 
measured directly. The other common method uses the 
thermal sensor as the resistive element in a tuned electri¬ 
cal circuit. The tuned frequency depends on the resistance 
of the sensor element, which is related in a known way to 
temperature. 
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Box 4.3: The error function 


Errors may be of two types: systematic and random. A 
systematic error results, for example, when the measuring 
device is wrongly calibrated (e.g., if times are measured 
with a clock that runs “fast” or “slow”). Random errors 
occur naturally when a value is measured a large number 
of times. The observations will be distributed randomly 
about their mean value. Usually there will be a few large 
deviations from the mean and a lot of small deviations, 
and there will be as many negative as positive deviations. 
The scatter of the results can be described by the stan¬ 
dard deviation a of the measurements. Random errors 
are described by the normal distribution, which is often 
called a “bell curve” because of its shape (Fig. B4.3a). If 
the mean of the distribution of the paramer u is 0 and the 
standard deviation of the mean is a, the normal distribu¬ 
tion is described by the probability density function 

f(u) = ^k e ~ (UAT)2/2 0 ) 

The standard normal distribution is defined so that it 
has a mean of zero and standard deviation a = 1. When 
integrated from — °° to +°°, the area under this curve is 

J/(w)dw= j== Je _ “ 2/2 dw= 1 (2) 

The error function is closely related to the standard 
normal distribution. However, only positive values are 
considered, so the graph of the defining function is similar 
to the right half of the curve in Fig. B4.3a. It is given as 

(3) 


The area under this curve from the origin at u = 0 to the 
value u = x (Fig. B4.3b) defines the error function erf(x): 


2 r 

erf(x)=-j= e 
V-nl 


dw 


(4) 


The complementary errorfunction, erfc(x), is defined as 
2 


erfc(.v) = 1 — erf(x) = 7- I 
VttJ. 


e “ 2 d« 


(5) 


The value of erf(.v) or erfc(.v) for any particular value of 
x may be obtained from tables. Some useful properties of 
the error function and complementary error function are: 


erf(°°)=-^]e 1,2 
s/ttJ 


£ (erf(x))= i e ~* 


d u = 1 


( 6 ) 

(7) 


(a) 



(b) 



Fig. B4.3 (a) The normal distribution, and (b) the error 
function. 

j erfc(x)dx = [x-erfc(x)]” - Jx^ (erfc(x) ) dx 


= — xerfc(x) — x — f=e x ~ dx 


= -j= Txe x2 dx — xerfc(x) 

VirJ. 

= ,L[- e“ v2 ]“ - xerfc(x) 
yrr 


a/tt 


— xerfc(x) 


l erfc( ' v)di= i 


( 8 ) 


(9) 


Some values of erffx) are listed in the following table. 


X 

erf(x) 

X 

erffx) 

X 

erf(x) 

X 

erf(x) 

0.05 

0.05637 

0.3 

0.32863 

0.6 

0.60386 

1.2 

0.91031 

0.1 

0.11246 

0.35 

0.37938 

0.7 

0.67780 

1.4 

0.95229 

0.15 

0.16800 

0.4 

0.42839 

0.8 

0.74210 

1.6 

0.97635 

0.2 

0.22270 

0.45 

0.47548 

0.9 

0.79691 

1.8 

0.98909 

0.25 

0.27633 

0.5 

0.52050 

1.0 

0.84270 

2.0 

0.99532 
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erfc(r)). 


From the measured temperature distribution, the 
average temperature gradient is computed for a geologi¬ 
cal unit or a selected depth interval (Fig. 4.20). The gradi¬ 
ent is then multiplied by the mean thermal conductivity 
of the rocks to obtain the interval (or formation) heat 
flow. Thermal conductivity can show large variations 
even between adjacent samples, so the harmonic mean of 
at least four samples is usually used. The interval heat- 
flow values may then be averaged to obtain the mean heat 
flow for the borehole (Table 4.6). 

The computation of continental heat flow from bore¬ 
hole data requires the implementation of several correc¬ 
tions. An important assumption is that heat flow is only 
vertical. Well below the surface the isotherms (surfaces of 
constant temperature) are flat lying and the flow of heat 
(normal to the isotherms) is vertical. Flowever, the surface 
of the Earth is also presumed to be locally isothermal 
(i.e., to have constant temperature) near to the borehole. 
The near-surface isotherms adapt to the topography (Fig. 
4.21) so that the direction of heat flow is deflected and 
acquires a horizontal component, while the vertical tem¬ 
perature gradient is also modified. Consequently, heat 
flow measured in a borehole must be corrected for the 
effects of local topography. 

The need for a topographic correction was recognized 
late in the nineteenth century. Further corrections must 
be applied for long-term effects such as the penetration of 
external heat related to cyclical climatic changes, for 
example the ice ages. Erosion, sedimentation and changes 
in the thermal conductivities of surface soils are other 
long-term effects that may require compensation. 

4.2.7.1 Reconstruction of ground surface temperature 
changes from borehole temperature profiles 

The main variation of temperature with depth in the 
Earth is the geothermal gradient related to the outflow of 
heat from the Earth’s deep interior. Over limited depth 


Table 4.6 Computation of heat flow from temperature 
measurements in WSR.1, a 570 m deep borehole, and 
thermal conductivity measurements on cored samples 
(after Powell et al., 1988) 


Depth 

interval 

[m] 

Temperature 

gradient 

[mKm 1 ] 

Thermal 
conductivity 
[Wnr 1 KA 1 ] 

Interval 
heat flow 
[mW nr 2 ] 

45-105 

15.0 

3.96 

60 

105-245 

18.0 

3.43 

62 

245-320 

24.8 

2.75 

68 

320^155 

16.0 

4.18 

67 

455-515 

17.2 

4.20 

72 

515-575 

16.5 

3.86 

64 

Mean heat flow = 65 mW m 2 



intervals the temperature profile is linear, its slope 
varying by region with local conditions. Flowever, 
changes in temperature at Earth’s surface affect the sub¬ 
surface temperature distribution close to the surface 
(Section 4.2.6.1). Rapid temperature changes have 
shallow penetration, but slow changes can modify sub¬ 
surface temperatures well below the surface. For 
example, daily variations do not reach below about a 
meter, but temperature variations over a century may 
extend to about 200 m depth. 

The temperature profiles in boreholes from eastern 
Canada (Fig. 4.22) show effects of surface temperature 
variations above depths of 180-250 m, below which the 
regular geothermal gradient is evident. The curved upper 
segment contains information about the history of tem¬ 
perature changes at the surface. This history can be 
retrieved by inversion of the borehole temperature data, 
which is a complex and sophisticated process. A large 
database of borehole temperature measurements, used 
for the determination of the global heat flow pattern 
(Section 4.2.8.2), has been analyzed to obtain the varia¬ 
tion of mean surface temperature since 1500 (Fig. 4.23). 
Instrumental measurements of air temperature since 
1860 agree well with the long-term surface temperatures 
estimated from the borehole data. The results show a con¬ 
sistent increase in surface temperature over the past five 
centuries. 

42.7.2 Variation of continental heat flow with age 

Many processes contribute to continental heat flow. 
Apart from the heat generated by radioactive decay, the 
most important sources are those related to tectonic 
events. During an orogenic episode various phenomena 
may introduce heat into the continental crust. Rocks 
may be deformed and metamorphosed in areas of conti¬ 
nental collision. In extensional regions the crust may be 
thinned, with intrusion of magma. Uplift and erosion 
of elevated areas and deposition in sedimentary basins 
also affect the surface heat flow. After a tectonic event 
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Fig. 4.20 Computation of 
heat flow by the interval 
method for geothermal data 
from drillhole WSR.1 on the 
Colorado Plateau of the 
western USA; horizontal bars 
show standard deviations of 
measurements in each depth 
interval (after Powell eta!., 

1988; based on data from 
Bodell and Chapman, 1982). 




Fig. 4.21 Schematic effect of surface topography on isotherms (solid 
lines) and the direction of heat flow (arrows). 


convective cooling takes place efficiently in circulating 
fluids, while some excess heat is lost by conductive 
cooling. Consequently, the variation of continental heat 
flow with time is best understood in terms of the 
tectonothermal age, which is the age of the last tectonic 
or magmatic event at a measurement site. The continen¬ 
tal heat-flow values comprise a broad spectrum. Even 
when grouped in broad age categories there is a large 
degree of overlap (Fig. 4.24). The greatest scatter is seen 
in the youngest regions. The mean heat flow decreases 
with increasing crustal or tectonothermal age (Fig. 
4.25), falling from 70-80 mW m -2 in young provinces to 
a steady-state value of 40-50 mW m -2 in Precambrian 
regions older than —800 Ma . 


4.2.73 Heat transfer through porous crustal rocks 

The transfer of heat through the continental crust takes 
place not only by conduction but also by advection. A 
sediment or rock is composed of mineral grains closely 
packed in contact with each other. The pore spaces 
between the grains can represent an appreciable fraction 
of the total volume of a rock sample. This fraction, some¬ 
times expressed as a percentage, is the porosity (j) of the 
rock. A highly porous rock, for example, may have a 
porosity of 0.3, or 30%, implying that only 70% of the 
rock is solid mineral. The porosity depends on how the 
mineral grains are arranged, how well they are cemented, 
and on their degree of sorting. Well sorted sediments have 
fairly uniform grain and pore sizes; in poorly sorted sedi¬ 
ments, there is a range of grain sizes, so the finer grains 
may “block” the voids between the larger grains, reducing 
the porosity. Igneous and metamorphic rocks often 
contain cracks and fissures, which, if sufficiently numer¬ 
ous, may give these rocks a low porosity. The degree to 
which the pore spaces are connected with each other 
determines the permeability of the rock, which is its 
ability to transmit fluids such as water and petroleum. 

Permeability is defined by an empirical relationship 
observed in 1856 by Henry Darcy, a French hydraulic 
engineer. He observed that the volume of fluid per second 
crossing a surface was proportional to the area of the 
surface (A) and the gradient of the hydraulic pressure 
head (dp/dx) driving the flow, assumed here to be in the x- 
direction, and inversely to the viscosity (/x) of the liquid. 
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Fig. 4.22 Temperature-depth 
profiles in three boreholes in 
eastern Canada. The linear 
segment in the deeper part of 
each borehole is the local 
geothermal gradient. 
Climatically induced variations 
in the ground surface 
temperature result in the 
curved segment superposed 
on the linear record in 
approximately the upper 200 
meters of each profile (after 
Pollack and Huang, 2000). 


Temperature (°C) 



Fig. 4.23 History of surface 
temperature change (with ±1 
standard error, shaded area) 
inferred for the past 500 years 
from a global database of 
borehole temperature 
measurements. The 
superposed signal since 1860 
is a 5-year running mean of 
the globally averaged 
instrumental record of surface 
air temperature (after Pollack 
and Huang, 2000). 



If the velocity of the flow is v, then in one second the 
volume crossing a surface A is (vA), so we have 


wit 

_k&P 
v A dx 


A dp 
A 1 dx 


V = 



(4.61) 


This equation, in which K=kl /jl defines the hydraulic 
conductivity, is known as Darcy’s law. Its derivation and 
form are analogous to the law of heat conduction (Eq. 
(4.38), Fig. 4.15) and Ohm’s law for electrical currents 
(Eq. (4.71), Fig. 4.40). The negative sign in the equation 
indicates that the flow is in the direction of decreasing 
pressure. The constant k is the permeability, which has 
dimensions m 2 . However, the unit of permeability used in 
hydraulic engineering is called the darcy (equivalent to 


9.87 x 10~ 13 m 2 ) or, more practically, the millidarcy (md). 
For example, an approximate range for the permeability 
of gravel is 10 5 -10 8 md, that of sandstone is 1 —100 md, 
and that of granite is 10 _3 -10 -5 md. 

The ability of fluids to flow through crustal rocks 
enables them to transmit heat. In this case the process of 
heat transfer is not by convection, because the fluid motion 
is not driven by temperature differences but by the pressure 
gradient. The heat transfer “piggybacks” on the fluid 
motion, and the process is called advection (Section 
4.2.4.2). The motion of water through the continental crust 
provides a source of geothermal energy, which can be 
tapped in several interesting ways for commercial purposes. 

4.2.8 Oceanic heat flow 

Whereas the mean altitude of the continents is only 840 m 
above sea-level, the mean depth of the oceans is 3880 m; 
the abyssal plains in large ocean basins are 5-6 km deep. 
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Fig. 4.24 Histograms of continental heat flow for four different age 
provinces (after Sclater etal., 1981). 



Tectonothermal age (Ga) 



Crustal age (Ga) 


Fig. 4.25 Continental heat-flow data averaged (a) by tectonothermal 
age, defined as the age of the last major tectonic or magmatic event 
(based on data from Vitorello and Pollack, 1980), and (b) by radiometric 
crustal age (after Sclater etal., 1980). The width of each box shows the 
age range of the data; the height represents one standard deviation on 
each side of the mean heat flow indicated by the cross at the center of 
each box. Numbers indicate the quantity of data for each box. 


At these depths extra-terrestrial heat sources have no 
effect on heat-flow measurements and the flatness of the 
ocean bottom (except near ridge systems or seamounts) 
obviates the need for topographic corrections. Measuring 
the heat flow through the ocean bottom presents technical 
difficulties that were overcome with the development of 
the Ewing piston corer. This device, intended for taking 
long cores of marine sediment from the ocean floor, 
enables in situ measurement of the temperature gradient. 
It consists of a heavily weighted, hollow sampling pipe 
(Fig. 4.26a), commonly about 10 m long although in 
special cases cores over 20 m in length have been taken 
(very long coring pipes tend to bend before they reach 
maximum penetration). A plunger inside the pipe is dis¬ 
placed by sediment during coring and makes a seal with 
the sediment surface, so that sample loss and core defor¬ 
mation are minimized when the core is withdrawn from 
the ocean floor. Thermistors are mounted on short arms a 
few centimeters from the body of the pipe, and the tem¬ 
peratures are recorded in a water-tight casement. The 
instrument is lowered from a surface ship until a free- 
dangling trigger-weight makes contact with the bottom 


(Fig. 4.26b). This releases the corer, which falls freely and 
is driven into the sediment by the one-ton lead weight. 
The friction accompanying this process generates heat, 
but the ambient temperatures in the sediments can be 
measured and recorded before the heat reaches the offset 
sensors (Fig. 4.26c). The sediment-filled corer is hauled 
back on board the ship, where the thermal conductivity of 
the sediment can be determined. The recovered core is 
used for paleontological, sedimentological, geochemical, 
magnetostratigraphic and other scientific analyses. 

Special probes have been devised explicitly for in situ 
measurement of heat flow. They consist of two parallel 
tubes about 3-10 m in length and 5 cm apart. One tube is 
about 5-10 cm in diameter and provides strength; the 
other, about 1 cm in diameter, is oil filled and contains 
arrays of thermistors. After penetration of the ocean- 
bottom sediments, as described for the Ewing corer, the 
equilibrium temperature gradient is measured. A known 
electrical current, either constant in value or pulsed, is 
then passed along a heating wire and the temperature 
response is recorded. The observations allow the thermal 
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Fig. 4.26 Method of 
measuring oceanic heat flow 
and recovering samples of 
marine sediments: (a) a coring 
device is lowered by cable to 
the sea-floor, (b) when a 
trigger-weight contacts the 
bottom, the corer falls freely, 
and (c) temperature 
measurements are made in 
the ocean floor and the 
sediment-filled corer is 
recovered to the surface ship. 




conductivity of the sediment to be found. In this way a 
complete determination of heat flow is obtained without 
having to recover the contents of the corer. 

4.2.8.1 Variation of oceanic heat flow and depth with 
lithospheric age 

The most striking feature of oceanic heat flow is the 
strong relationship between the heat flow and distance 
from the axis of an oceanic ridge. The heat flow is highest 
near to the ridge axis and decreases with increasing dis¬ 
tance from it. For a uniform sea-floor spreading rate the 
age of the oceanic crust (and lithosphere) is proportional 
to the distance from the ridge axis, and so the heat flow 
decreases with increasing age (Fig. 4.27). The lithospheric 
plate accretes at the spreading center, and as the hot mate¬ 
rial is transported away from the ridge crest it gradually 
cools. Model calculations for the temperature in the 
cooling plate are discussed in the next section: they all 
predict that the heat flow q caused by cooling of the plate 
decreases with age t as 1 Nt, when the age of the plate is 


less than about 55-70 Ma. Older lithosphere cools slightly 
less rapidly. Currently the decrease in heat flow with age is 
best explained by a global model called the Global Depth 
and Heat Flow model (GDH1). The model predicts the 
following relationships between heat flow (q, mWm -2 ) 
and age ( t , Ma): 



q = <7 S [1 + 2exp( — K-rr 2 t/a 2 )] (t>55Ma) (4.62) 

= 48 + 96exp( - 0.0278r) 

Here q s is the asymptotic heat flow, to which the heat flow 
decreases over very old oceanic crust (~ 48 mW m -2 ), a is 
the asymptotic thickness of old oceanic lithosphere 
(=95 km), and k is its thermal diffusivity («0.8xl0~ 6 
m 2 s _1 ). 

Close to a ridge axis the measured heat flow is unpre¬ 
dictable: extremely high values and very low values have 
been recorded. Over young lithosphere the observed heat 
flow is systematically less than the values predicted by 
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Fig. 4.27 Comparison of 
observed and predicted heat 
flow as a function of age of 
oceanic lithosphere, (a) 
Schematic summary for all 
oceans, showing the 
influence of hydrothermal 
heat flow at the ocean ridges 
(after Anderson and Skilbeck, 
1981). Comparisons with the 
reference cooling models PSM 
(Parsons and Sclater, 1977) 
and GDH1 (Stein and Stein, 

1992) for (b) the Pacific, (c) 
Atlantic and (d) Indian oceans. 




cooling models (Fig. 4.27). The divergence is related to 
the process of accretion of the new lithosphere. At a 
ridge crest magma erupts in a narrow zone through 
feeder dikes and/or supplies horizontal lava flows. Very 
hot material is brought in contact with sea water, which 
cools and fractures the fresh rock. The water is in turn 
heated rapidly and a hydrothermal circulation is set up, 
which transports heat out of the lithosphere by convec¬ 
tion. The eruption of hot hydrothermal currents has 
been observed directly from manned submersibles in the 
axial zones of oceanic ridges. The expeditions witnessed 
strong outpourings of mineral-rich hot water (called 
“black smokers” and “white smokers”) in the narrow 
axial rift valley. The heat output of these vents is high: 
the power associated with a single vent has been esti¬ 
mated to be about 200 MW. 

About 30% of the hydrothermal circulation takes 
place very near to the ridge axis through crust younger 
than 1 Ma. The rest is due to off-ridge circulation, which 
is possible because the fractured crust is still permeable 
to sea-water at large distances from the ridge axis. As it 
moves away from the ridge, sedimentation covers the 
basement with a progressively thicker layer of low per¬ 
meability sediments, inhibiting the convective heat loss. 
The hydrothermal circulation eventually ceases, perhaps 
because it is sealed by the thick sediment cover, but prob¬ 
ably also because the cracks and pore spaces in the crust 
become closed with increasing age. This is estimated 
to take place by about 55-70 Ma, because for greater 
ages the observed decrease in heat flow is close to that 
predicted by plate cooling models. The hydrothermal 


circulation in oceanic crust is an important part of the 
Earth's heat loss. It accounts for about a third of the 
total oceanic heat flow, and a quarter of the global heat 
flow. 

The free-air gravity anomaly over an oceanic ridge 
system is generally small and related to ocean-bottom 
topography (Section 2.6.4.3), which suggests that the 
ridge system is isostatically compensated. As hot material 
injected at the ridge crest cools, its volume contracts and 
its density increases. To maintain isostatic equilibrium a 
vertical column sinks into the supporting substratum as it 
cools. Consequently, the depth of the ocean floor (the top 
surface of the column) is expected to increase with age of 
the lithosphere. The cooling half-space model predicts an 
increase in depth proportional to a ft, where t is the age of 
the lithosphere, and this is observed up to an age of about 
80 Ma (Fig. 4.28). However, the square-root relationship 
is not the best fit to the observations. Other cooling 
models fit the observations more satisfactorily, although 
the differences from one model to another are small. 
Beyond 20 Ma the data are better fitted by an exponential 
decay. The optimum relationships between depth (d, m) 
and age (t, Ma) can be written 

d = 2600 + 365a/? (r<20Ma) 

d= d T + [1 — (8/TT 2 )exp( — kTT 2 t/a 2 )] (f>20Ma) (4 63 ) 

= 5651 -2473 exp( - 0.02780 

where d r is the mean depth of the ocean floor at ridge 
crests, d s is the asymptotic subsidence of old lithosphere 
and the other parameters are as before. 
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Fig. 4.28 Relationship between mean ocean depth and the square root 
of age for the Atlantic, Pacific and Indian oceans, compared with 
theoretical curves for different models of plate structure (after Johnson 
and Carlson, 1992). 

4.2.82 Global heat flow 

Oceanic heat flow has been measured routinely in oceano¬ 
graphic surveys since the 1950s and in situ profiles have 
been made since the 1970s. In contrast to the measure¬ 
ment of continental heat flow it is not necessary to have 
an available (and usually expensive) drillhole. However, 
the areal coverage of the oceans by heat-flow measure¬ 
ments is uneven. A large area in the North Pacific Ocean 
is still unsurveyed, and most of the oceanic areas south of 
about latitude 35°S (the approximate latitude of Cape 
Town or Buenos Aires) are unsurveyed or only sparsely 
covered. The uneven data distribution is dense along the 
tracks of research vessels and absent or meager between 
them. The sites of measured continental heat flow are 
even more irregularly distributed. Antarctica, most of the 
interiors of Africa and South America, and large 
expanses of Asia are either devoid of heat-flow data or 
are represented by only a few sites. 

In recent years a global data set of heat-flow values has 
been assembled, representing 20,201 heat-flow sites. The 
data set is almost equally divided between observations 
on land (10,337 sites) and in the oceans (9,864 sites). 
Histograms of the heat-flow values are spread over a wide 
range for each domain (Fig. 4.29). The distributions have 
similar characteristics, extending from very low, almost 
zero values to more than 200mWm“ 2 . The high values 
on the continents are from volcanic and tectonically 
active regions, while the highest values in the oceans are 
found near to the axes of oceanic ridges. Both on the con¬ 
tinents (Fig. 4.25) and in the oceans (Fig. 4.27), heat flow 
varies with crustal age. To determine global heat-flow sta¬ 
tistics, the fraction of the Earth’s surface area having a 
given age is multiplied by the mean heat flow measured 
for that age domain. The weighted sum gives a mean heat 
flow of 65mWm -2 for the continental data set. The 



Heat flow (mW m 2 ) 


Fig. 4.29 Histograms of continental, oceanic and global heat-flow 
values (after Pollack eta/., 1993). 

oceanic data must be corrected for hydrothermal circula¬ 
tion in young crust; the areally weighted mean heat flow is 
then lOlmWm^ 2 for the oceanic data set. The oceans 
cover 60.6% and the continents 39.4% of the Earth’s 
surface, the latter figure including 9.1% for the continen¬ 
tal shelves and other submerged continental crust. The 
weighted global mean heat flow is 87 mW m -2 . Multiply¬ 
ing by the Earth’s surface area, the estimated global heat 
loss is found to be 4.42 X 10 13 W (equivalent to an annual 
heat loss of 1.4 X 10 21 J). About 70% of the heat is lost 
through the oceans and 30% through the continents. 

The heat-flow values in both continental and oceanic 
domains are found to depend on crustal age and geological 
characteristics. These relationships make it possible to 
create a map of global heat flow that allows for the uneven 
distribution of actual measurements (Fig. 4.30). The pro¬ 
cedure in creating this map was as follows. First, the 
Earth’s surface was divided into 21 geological domains, of 
which 12 are in the oceans and 9 on the continents. Next, 
relationships between heat flow and age were used to asso¬ 
ciate a representative heat flow with each domain (Table 
4.7). This made it possible to estimate heat flow for regions 
that have no measurement sites. Allowance was also made 
for the loss of heat by hydrothermal circulation near to 
ridge systems. The surface of the globe was next divided 
into a grid of 1° elements (i.e., each element measures 1°X 
1°), and the mean heat flow through each element was esti¬ 
mated. This gave a complete data set (partly observed and 
partly synthesized) covering the entire globe. The gridded 
data were fitted by spherical harmonic functions (as in the 
representation of the geoid, Section 2.4.5) up to degree and 
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Table 4.7 Mean heat-flow values for the oceans and continents, based on measurements at 20,201 sites (after Pollack 
et al., 1993) 


The oceanic heat-flow values in italics are corrected for hydrothermal circulation according to the model of Stein and Stein 
(1992). 


Description 

Number of sites 

Area of Earth [%] 

Heat flow [mW m 2 ] 

Oceans 

Quaternary 

415 

1.2 

806 

Pliocene 

712 

2.4 

286 

Miocene 

1,211 

9.2 

142 

Oligocene 

593 

7.7 

93 

Eocene 

691 

7.8 

75 

Paleocene 

205 

3.9 

65 

Late Cretaceous 

359 

6.9 

60 

Middle Cretaceous 

695 

11.2 

54 

Early Cretaceous 

331 

4.3 

51 

Late Jurassic 

295 

3.8 

49 

Cenozoic undifferentiated 

846 

2.2 

89 

Mesozoic undifferentiated 

599 

0.2 

45 

All oceanic data 

6,952 

60.6 

101 
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Fig. 4.30 Global distribution 
of heat flow (mW rrr 2 ). The 
contours show a degree and 
order 12 spherical harmonic 
representation of the global 
heat flow based on direct 
measurements and empirical 
estimators for regions without 
data (after Pollack ef a/., 

1993). 
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order 12. The results of the analysis were used to compute 
smooth contours of equal heat flow, which were then 
plotted as the global heat-flow map (Fig. 4.30). If the 
global mean value is subtracted, the Earth’s surface can be 


divided into regions with above-average and below-average 
heat flow, respectively (Fig. 4.31). The regions with above- 
average heat flow are notably associated with the oceanic 
ridge systems. About half of the Earth’s heat is lost by the 




4.2 THE EARTH'S HEAT 


243 


Fig. 4.31 Geographic regions 
where the heat flow is higher 
(lighter shaded) and lower 
(unshaded) than the global 
mean heat flow; lines (darker 
shaded) mark positions of 
plate boundaries (after Pollack 
eta/., 1993). 



cooling of oceanic lithosphere of Cenozoic age (younger 
than 65 Ma). 

One must keep in mind that this global model is based 
on a mixture of actual heat-flow measurements in regions 
where they are available, and estimated values in inacces¬ 
sible regions. Moreover, the measured data near ocean 
ridges are replaced with values predicted by cooling 
models to compensate for the known loss of heat by 
hydrothermal circulation. Nevertheless, these global heat- 
flow maps are the best available representations of the 
geographical pattern and flux of the heat flowing out of 
the Earth’s interior. Although details may eventually need 
modification, the main features are not in doubt. 

4.2.83 Models for the cooling of oceanic lithosphere 

The variations of heat flow and ocean depth with time 
constrain the possible thermal models for cooling of the 
lithosphere in different ways. The predicted heat flow is 
contingent on the temperature gradient in a model, but 
the oceanic depth is defined by the vertical distribution of 
density, which, in turn, depends on the volume coefficient 
of expansion and the temperature profile in the plate. 
Thus, oceanic bathymetry depends on the temperature 
integrated over depth. 

Several cooling models have been proposed, all of 
which satisfy the decrease in heat flow and increase in 
ocean depth with age. The simplest model represents the 
cooling lithosphere as a semi-infinite half-space (Fig. 
4.32a). Initially, the temperature inside the half-space is 
uniform and higher than on its upper surface, which is 
maintained at the temperature of cold ocean-bottom sea¬ 
water. As long as the lithosphere is thin - which it is near 
the ridge - horizontal heat conduction can be neglected. 
The heat flow in the uniform half-space is vertical, along 
the z-axis, and is equivalent to the one-dimensional flow 
in a thin vertical column (Fig. 4.33a). The spreading 



Fig. 4.32 (a) Semi-infinite half-space, (b) thermal boundary layer, and 
(c) plate models for the cooling of oceanic lithosphere. T s and T m are 
surface and mantle temperature, respectively. 

process can be envisioned as transporting the column 
away from the ridge axis, during which conductive 
cooling takes place and the temperature distribution in 
the column changes. 

This model allows us to compute the temperature dis¬ 
tribution in the oceanic lithosphere. We need to compute 
the depth z at which a given temperature T is reached 
after time t, when the vertical column has moved at veloc¬ 
ity v to a distance vt from the ridge. First, the desired tem¬ 
perature T is expressed as a fraction of the mantle 
temperature T m . Using Eq. (4.57) and the appropriate 
table, the argument tj 0 is found which gives an error func¬ 
tion equal to ( TIT m ). Setting the numerical value r) 0 equal 
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(b) Age (Ma) 



Fig. 4.33 Application of the infinite half-space model to explain the 
cooling of oceanic lithosphere: (a) vertical heat flow in narrow columns 
that move away from the ridge crest, and (b) predicted thermal 
structure in the cooling plate (after Turcotte and Schubert, 1982). 

to i/2^k1 gives the shape of the isotherm for the tempera¬ 
ture T: 

Wt =Vo 

z = (2 i7 0 ^k) \[t (4.64) 

The isotherms in the cooling lithosphere have a parabolic 
shape with respect to the time (or horizontal distance) 
axis, of which only the part for z> 0 is of interest. The 
surface heat flow for this model is given by Eq. (4.60), and 
so is inversely proportional to Vf. 

The half-space model has some unrealistic aspects. It 
predicts infinitely large heat flow at the ridge axis, and the 
initial mantle temperature T m is approached asymptoti¬ 
cally and is only reached at infinite depth. The distances 
between successive isotherms for equal increments in 
temperature get progressively larger. The near-surface 
layer in which the temperature changes are significant has 
been called a thermal boundary layer. Its base is defined 
arbitrarily as the depth at which the temperature reaches 
a chosen fraction of T m . The layer can be regarded as a 
thermal model of the lithosphere (Fig. 4.32b). Instead of 
being defined mechanically as the depth where seismic 
shear waves are attenuated, the base of the lithosphere in 
the thermal model is an isotherm (Fig. 4.33b). The model 
predicts that the lithosphere becomes thicker with 
increasing age, as also inferred from seismic data, and the 


thickness is proportional to Vr. As it cools and thickens, 
the lithosphere sinks deeper into the asthenosphere, so 
that the ocean depth increases away from a ridge axis. 
Together with Pratt-type, thermally influenced isostasy 
the half-space model of lithosphere cooling predicts a 
depth increase that is also proportional to Vf (Box 4.4). 

Parker and Oldenburg (1973) proposed a modification 
of the boundary-layer model in which a solid lithosphere 
overlies a fluid asthenosphere. The base of the lithosphere 
is taken to be the solid-liquid phase boundary of the 
material. It is defined by the melting-point isotherm, and 
denotes a phase change. This is probably a closer repre¬ 
sentation of the real situation, although, by treating the 
asthenosphere as a fluid, it exaggerates the change in rhe¬ 
ology. The temperature of the asthenosphere lies close to 
the solidus temperature, but its condition is only partially 
molten (perhaps about 5%). 

The half-space and boundary-layer models fit the 
observed variations of heat flow and ocean depth with 
age for young lithosphere. For ages greater than about 
70 Ma the ocean depths in particular are less than pre¬ 
dicted by the a// relationship (Fig. 4.28). This suggests that 
the source of heat from below the lithosphere may be 
shallower than in the half-space model at large ages. As 
an alternative to the half-space models the oceanic lithos¬ 
phere has been modelled as a flat layer or plate of finite 
thickness, bounded above by cold sea-water and with a 
constant temperature on its lower surface. Far from a 
ridge axis this model brings hot mantle temperatures 
nearer to the surface than in the half-space model. Below 
the ridge axis the vertical edge of the new plate has the 
same high temperature of its lower surface (Fig. 4.32c), 
which results in heat being conducted horizontally 
through the plate. This is not a serious problem as long as 
the plate is much thinner than its horizontal extent away 
from a ridge. This condition is clearly met for the main 
lithospheric plates, which are several thousand kilometers 
across and only of the order of a hundred kilometers 
thick. 

The plate model is not intended to model the vertical 
mechanical structure of the plate, but only to explain in a 
phenomenological way the typical age dependence of 
both ocean depth and heat flow. The plate thickness in the 
model is the asymptotic thermal thickness of old oceanic 
lithosphere and reflects the combined effects of tempera¬ 
ture and rheology. Its horizontal isothermal base requires 
additional deep heat sources that prevent the lithosphere 
from cooling as a half-space at great ages. The model 
allows simple computation of the thermal cooling 
history. The best known version, proposed by Parsons 
and Sclater (1977), assumed a plate thickness of 125 km 
and a basal temperature of 1350 °C. At large distances 
from each spreading center it gives very good fits to both 
the observed heat flow (Fig. 4.27) and the ocean depth 
(Fig. 4.28). The most recent update, the GDH1 plate 
model, has a thickness of 95 km and a basal temperature 
of 1450 °C. It fits the observations even better. 




4.2 THE EARTH'S HEAT 


245 


Box 4.4: Variation of ocean depth with age 


As oceanic lithosphere moves away from the ridge it 
cools, thickens, and becomes denser (Fig. 4.31). It sinks 
progressively into the underlying asthenosphere with 
time, so that the depth of an oceanic basin increases 
with age t of the oceanic lithosphere. A simple model 
accounts for the depth change w by assuming that the 
lithosphere and asthenosphere are in Pratt-type isosta¬ 
tic balance. The isostatic model is sometimes referred to 
as thermal isostasy. 

Compare the composition of two vertical columns of 
unit cross-sectional area above a compensation level in 
the asthenosphere at depth D (Fig. B4.4). The column 
below R on the ridge axis consists of hot asthenosphere, 
of assumed constant density p a and temperature T a , and 
the depth d T of sea-water (density p w ) above the ridge. 
The column below B over the adjacent ocean basin is a 
section through oceanic lithosphere. The density p L of 
the lithosphere depends on its temperature T L and the 
lithosphere thickness L increases with age t. The sea¬ 
water layer of depth d v above the ridge is present in both 
columns, as is the thickness A of asthenosphere between 
the base of the lithosphere and the compensation depth. 
The isostatic balance is determined by equating the 
weights of w km of seawater and L km of lithosphere 
with the corresponding weight of (w + L) km of 
asthenosphere: 

(w + L)p a g = wp^g + gl p L dz (1) 

J o 

L 

W’(P a “ Pw) = [ (PL “ Pa) d - (2) 

J 0 

Thermal isostasy assumes that the lithosphere 
changes density as it cools. The volume coefficient of 
expansion a is defined by Eq. (4.26) as 


R B 



Fig. B4.4 Vertical section through oceanic lithosphere from a ridge to 
an adjacent ocean basin. 


w = , “ Pa . t (T a - r a erf(p)) dz 
(Pa“Pw)J 0 " 3 

= t— P a a - t erfc( i7)d z with 17 = ~— (6) 

(Pa-PwH 2 yfc 

The complementary error function, erfc(p), decreases 
almost to zero by 17 = 2 , so the upper limit of integration 
can be changed from L to 00 without causing significant 
error: 


«Pa r a 
(Pa Pw) 

a P,J A , 

^Pa Pw) 


erfc(T 7 )dc 


!V k) erfcfpldp 


(7) 


From Box 4.3, Eqs. ( 8 ) and (9), we have 
J“erfc(rj)d 17 = 1/a/tt. Thus, 


w = 


2 «Pa r a 
a/tt (p a “ Pw) 


a/k) 


( 8 ) 


_ J_dF_ _1 dp 
“ VdT P dT 

where density p = MIV and thus dVIV=-dplp. Re¬ 
writing Eq. (3), we get 

“Pa( r a -T L ) = (p L -p a ) (4) 

The expression for the density difference between the 
lithosphere and asthenosphere is now substituted into 
Eq. (2): 

w = , aPa T f (T ~ 72) dz (5) 

(Pa“Pw)J 0 


This is the amount by which the ocean deepens away 
from the ridge axis. The total depth of the ocean, taking 
into account the depth d r at the ridge, is d=(d r + w). 
Optimum values for the parameters of the lithosphere 
in Eq. ( 8 ) are given in the Global Depth and Heat Flow 
model (GDH1) of Stein and Stein (1992): a = 3.1 X 10 -5 
K-\p a = 3300kgn-T 3 , k = (klc p pj = 8.04 X 10- 7 m 2 s” 1 . 
Assuming a mean depth of 2600 m over the ridge axis, a 
temperature T a = 1450 C for the asthenosphere, and p w 
= 1030 kg m -3 for the density of sea-water, the depth of 
the ocean over crust of age t (in Ma) is given by 

d = (d T T w) = 2600 T SlO'sft (9) 


The temperature of the lithosphere T L is given by Eq. 
(4.57) with T a instead of T m . Substituting in Eq. (5), we 
find 


This computed result is close to the depth-age rela¬ 
tionship in Eq. (4.63) predicted by the GDH1 model for 
young lithosphere (age < 20 Ma). 
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42.8.4 Thermal structure of oceanic lithosphere 

The plate models explain observed thermal data better 
than the boundary-layer models. The boundary-layer 
model is most appropriate near to a ridge axis, and agrees 
better with other geophysical data, which show that the 
lithosphere thickens with distance from a ridge. However, 
the plate model is needed at great distances to explain heat 
flux and ocean depths over old lithosphere. To reconcile 
these contrasting attributes a two-layered model of the 
lithosphere has been proposed (Fig. 4.34). The upper layer 
is rigid and has a mechanically defined lower boundary, 
above which heat transfer is by conduction. Below this 
level the increasing temperature causes a change in 
mechanical properties. The lower lithosphere is plastic 
enough to permit material movement, and so behaves like 
a viscous solid. The base of the upper layer is an isotherm, 
representing the temperature at which rigidity is lost. The 
base of the lower lithosphere is a thermally defined 
boundary, and is also an isotherm. Several suggestions 
have been made as to how this structure may approximate 
the plate model for old lithosphere. They include addi¬ 
tional heat sources such as radiogenic heating, frictional 
heating as a result of shear at the base of the lithosphere, 
and reheating of old lithosphere due to the intrusion of 
mantle plumes at hotspots. It has also been postulated 
that, at lithosphere ages greater than about 70 Ma, small- 
scale convection currents in the lower lithosphere may 
augment thermal conduction. This would bolster the 
transfer of heat from the convecting asthenosphere into 
the lithosphere, effectively giving a thinner lithosphere 
than in the half-space models. Analysis of the dispersion 
of seismic surface waves indicates that there are differences 
in structure between continents and oceans down to about 
200 km. This is compatible with the thermal model of a 
rigid mechanical layer underlain by a convecting thermal 
boundary layer extending to about 150-200 km. 

42.8.5 Heat flow at subduction zones 

The oceanic lithosphere is bent sharply downward 
beneath the overriding plate in a subduction zone. It 
extends as an inclined slab deep into the upper mantle, 
which it penetrates at a rate of a few centimeters per year. 
The old lithosphere is cold, having lost much of its origi¬ 
nal heat of formation at the ridge axis. By the time it 
reaches an ocean trench the isotherms in the plate are far 
apart and the temperature gradient is small. The separa¬ 
tion of the isotherms is increased by the downward 
bending of the plate. Since the heat flow is proportional 
to the temperature gradient, very low heat-flow values 
(~ 35mW m -2 ) are measured in oceanic trenches. After 
bending downward the plate is subducted to great depths, 
subjecting it to increases in pressure and temperature. 
Heat is conducted into the plate from the adjacent 
mantle. This process is so slow that the interior of the sub¬ 
ducting slab remains colder than its environment (Fig. 




Fig. 4.34 Schematic diagram of lithospheric plate structure beneath 
oceans and continents. The dashed line indicates the approximation as 
a plate of constant thickness (based upon Parsons and McKenzie, 1978, 
and Sclater ef a/., 1981). 


4.35). A temperature of 800 °C is normally reached at 
about 70 km depth in the oceanic plate but in the descend¬ 
ing slab this temperature exists to deeper than 500 km. 
Above this depth the coldest part of the slab has a hori¬ 
zontal temperature deficit of 800-1000 K. 

Heat conducted from the mantle is not the only heat 
source that must be taken into account in modelling the 
thermal structure of the subducting slab. An important 
additional source is the frictional heating that results from 
shear deformation at the surfaces of the slab where it is in 
contact with the mantle. In the upper part of a subduction 
zone the shear heating melts the basaltic layer of the 
oceanic lithosphere and forms a layer of eclogite in the top 
of the slab. The high density of the eclogite causes a posi¬ 
tive gravity anomaly (see Fig. 2.62), and adds to the forces 
propelling the slab downward. The phase transition in 
which the open structure of olivine-type minerals converts 
to a denser spinel-type structure normally takes place at a 
depth of 400 km. The phase transition depends on temper¬ 
ature and pressure. Laboratory experiments indicate that it 
takes place at lower pressure at low temperature than at 
high temperature. Consequently it occurs at shallower 
depths within the cold plate than in the adjacent mantle. 
As a result the transition depth is deflected upward by 
about 100 km. The transition is exothermic and the latent 
heat given out in the transition is an additional heat source 
that contributes to the thermal structure of the subduction 
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Fig. 4.35 Bottom', the thermal structure of a subduction zone and 
back-arc region (the model of Schubert etal., 1975, inverted 
horizontally), showing the possible isotherms in the cold subducting 
plate and the thermal effects of the olivine-spinel and spinel-oxide 
phase changes. Top: comparison of heat-flow measurements across the 
Japanese trench with the theoretical heat flow (solid curve) computed 
by Toksoz ef a/. (1971). 

zone. The transition also results in a density increase, 
which adds to the forces driving the plate downward. 

The deeper transition at 670 km is less well under¬ 
stood. High temperature apparently causes it to take 
place at higher pressure, and so the depth of occurrence is 
deflected downward inside the subducting slab. It is 
uncertain whether the transition is endothermic, absorb¬ 
ing heat from the environment, or exothermic as assumed 
in the model in Fig. 4.35. An endothermic phase change 
has the effect of reducing the density, and acts against the 
other downward forces on the slab. 

Although other, slightly different models have been 
derived for the temperature distribution in the descending 
slab, they all have in common the downward deflection of 
isotherms in the cold descending slab. The heat flow can 
be computed for a given thermal model. When compared 
with the observed heat flow on a profile across the subduc¬ 
tion zone, the models fail to explain adequately the high 
heat flow observed on the overriding plate (Fig. 4.35). 
Volcanic activity is partly responsible, fed by magmas pro¬ 
duced by partial melting of oceanic crustal material in the 
descending slab and of the upper mantle in the overriding 
plate. Shallow melting is promoted by water from the sub¬ 
ducting plate and generates basaltic magma; deeper 
melting involves less water and results in andesitic magma. 
When the overriding plate is continental, volcanic chains 
form along the continental margin parallel to the deep 
oceanic trench. The volcanicity is typified by the eruption 
of both basaltic and andesitic lavas. The lavas are more 


felsic than those formed when two oceanic plates collide, 
which may imply that they include melted material from 
the upper mantle of the overriding continental plate. 

When two oceanic plates converge, a volcanic arc is 
formed on the overriding plate. Behind the arc, high heat 
flow on the overriding plate is related to back-arc spread¬ 
ing, in which new oceanic crust is generated by the intru¬ 
sion of basaltic magma from partial melting in the upper 
mantle. This form of sea-floor spreading produces a mar¬ 
ginal basin behind the island arc. The intrusion of magma 
is not confined to a single location, as at a ridge axis, but is 
spread diffusely in the basin. Consequently, the stripes of 
lineated oceanic magnetic anomalies characteristic of sea¬ 
floor spreading at ridge systems are missing or at best 
weakly defined in a marginal basin. 

4.2.9 Mantle convection 

It has gradually become accepted that thermally driven 
convection takes place in the mantle and that it is proba¬ 
bly the most important mechanism in geodynamic 
processes. There are several reasons for these conclusions. 
The evidence summarized in Section 2.8 demonstrates 
that the mantle has a viscoelastic rheology. The passage 
of seismic compressional and shear waves through the 
mantle attest that it reacts as a solid to abrupt stress 
changes. Yet, observations of post-glacial isostatic uplift 
and long-term movements of the rotation axis indicate 
that the mantle is capable of viscous flow when stressed 
over long time intervals. The surmised temperature distri¬ 
bution in the mantle implies that, although conduction is 
mainly responsible for heat transfer in the lithosphere, 
convection is the predominant process deeper in the 
mantle, involving mass transfer by sub-solidus creep. 
Applying the theory of thermal convection to the mantle 
and using the best available estimates of physical parame¬ 
ters indicates that robust convection must be taking place. 

4.2.9.1 Thermal convection 

The conditions for convection to occur (see Section 
4.2.4.2) reflect a balance between causal forces due to 
thermal expansion and resistive effects due to viscosity and 
thermal diffusivity. When a fluid is heated, thermal expan¬ 
sion gives rise to an upward buoyancy force. This produces 
instability, which is partly counteracted by diffusion of 
heat into the surrounding fluid by thermal conduction. As 
soon as a volume of the fluid starts to rise in response to 
the buoyancy force its motion is resisted by viscous forces. 
The effects are familiar to anyone who has heated a pan of 
thick soup or porridge. If the pan is heated too rapidly, or 
the instructions to “stir constantly” are ignored, the soup 
may stick to the bottom of the pan and become charred. 
This happens because the viscosity of the fluid is initially 
too large to allow convection. Despite the large tempera¬ 
ture gradient between the hot bottom of the pan and 
the cool surface of the liquid, conduction is unable to 
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Fig. 4.36 Some patterns of steady convection in a plane layer heated 
from below, (a) Convection rolls, (b) vertical flow in hexagonal patterns, 
for which the surface flow may be (c) outward away from or (d) inward 
toward the center of the cell (after Busse, 1989). 

transport heat away from the bottom of the pan fast 
enough to avoid charring. When heat flow by conduction 
reaches a critical limit, convection can begin. 

The onset of convection in a fluid layer heated from 
beneath was first described in 1900 by H. Benard on the 
basis of laboratory experiments. He noted that a hexago¬ 
nal pattern of cells forms on the surface of the layer (Fig. 
4.36). Hot fluid rises to the surface in the middle of each 
cell; at the surface it spreads out and cools. Adjoining cells 
come in contact at narrow margins, where the cooled fluid 
sinks back into the layer. Each cell has a rectangular cross- 
section in the vertical plane. A satisfactory theory of 
Benard’s observations was derived in 1916 by Lord 
Rayleigh. Although it applies to an ideal scenario (a hori¬ 
zontal layer with stress-free upper and lower boundaries, 
heated from below, and with a constant temperature on 
the upper surface) the theory permits approximate esti¬ 
mates for more complex convection in the spherical Earth. 

The flow of a viscous fluid is governed by the 
Navier-Stokes equation, one of the most important 
equations in geophysics. It describes the conservation of 
momentum in the fluid, which in its simplest form means 
balancing several terms that express the driving forces 
exerted by pressure gradient and buoyancy against the 
viscous and inertial forces that resist motion. The ratio of 
the other forces to the inertial forces is expressed by the 
dimensionless Prandtl number, Pr, defined as 

Pr = £ (4.65) 

where v is the kinematic viscosity, and k is the thermal 
diffusivity. In the mantle i/«10 18 m 2 s -1 and k« 10 -6 
m 2 s _1 , so that Pr~ 10 24 . The virtually infinite Prandtl 
number means that inertial forces are insignificant. 
Hence, mantle convection depends only on the conditions 
of pressure, temperature and viscosity. 





Thermal diffusivity and viscosity act as stabilizing 
influences in a heated fluid. If heating is slow enough, the 
temperature gradient adjusts to transfer the heat by con¬ 
duction, remaining close to the adiabatic gradient. 
Convection becomes possible when the real temperature 
gradient exceeds the adiabatic gradient; the difference 6 is 
called the superadiabatic gradient. The excess heat expands 
the fluid, causing the buoyancy force. When this becomes 
larger than the viscous resistance, convection ensues. The 
ratio of the competing forces is embodied in the Rayleigh 
number (Eq. (4.42)). The Rayleigh number (Ra^) for con¬ 
vection due to the superadiabatic temperature gradient in 
a fluid layer of thickness D is 

R a r=W Z)4 (4.66) 

where g is gravity and a is the coefficient of thermal 
expansion. 

The superadiabatic gradient is not the only source of 
power for convection. Although radioactive heat genera¬ 
tion in mantle materials is small (see Section 4.2.5.1), it 
can still contribute to convection. If Q is the radiogenic 
heat production in a layer of thickness D , we can invoke 
Eq. (4.39) and Eq. (4.45) and replace 6 in the above equa¬ 
tion by ( QDIk ), where k is the thermal conductivity. This 
allows us to define a second Rayleigh number (Ra 0 ) for 
convection driven by radiogenic heat: 

(4.67) 

Convection is initiated when the Rayleigh number 
exceeds a critical value, Ra c , which is dependent on the 
geometry of the flow and the boundary conditions on the 
upper and lower surfaces. In Rayleigh-Benard convection 
the top and bottom of the horizontal layer are stress free; 
the critical Rayleigh number is Ra c = 658. If the top and 
bottom of the layer are rigid boundaries at which the hor¬ 
izontal velocity vanishes, Ra c = 1708. Table 4.8 shows 
computed Rayleigh numbers Ra^ for convection driven 
by the superadiabatic temperature gradient for viscous 
flow in the upper, lower and whole mantle, assuming rep¬ 
resentative values from the literature for the parameters in 
Eq. (4.66). Reasonable estimates of the radiogenic heat 
produced in the mantle give even larger values for Ra 0 . 

42.9.2 Convection at high Rayleigh numbers 

The computed Rayleigh numbers greatly exceed the critical 
values for convection throughout the entire mantle or in 
separate layers. Thus, each region of the sub-lithospheric 
mantle is capable of convection. The Rayleigh number for 
whole-mantle convection is so much larger than the critical 
value Ra c that vigorous mantle convection must be 
expected. This does not imply rapid flow in normal terms. 
The speed of flow in the mantle is usually assumed to be of 
the same order as the rate of motion of tectonic plates, 
about 5-10cmyr _1 on average. As long as the flow rate v is 
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Table 4.8 Some physical parameters for mantle convection models (mostly from Jarvis and Peltier, 1989) 

The critical Rayleigh numbers (Ra c ) for the onset of convection in each part of the mantle are calculated assuming a 
superadiabatic temperature gradient 6 — 0.1 Kkm -1 and a mean gravity g = 10ms -2 . Lower mantle parameters are 
interpolated from the upper- and whole-mantle values. 



Upper mantle 

Lower mantle 

Whole mantle 

Physical parameter 

Units 

(70-670 km) 

(670-2890 km) 

(70-2890 km) 

Layer thickness (H) 

km 

600 

2220 

2820 

Expansion coefficient (a) 

K 1 

2 X 10 -5 

1.0 X 10 -5 

1.4 X 10 -5 

Density (p) 

kgm -3 

3700 

5500 

4700 

Specific heat (c p ) 

Jkg-'K -1 

1260 

1260 

1260 

Thermal conductivity (k) 

Wm - 1 K -1 

6.7 

20 

15 

Thermal diffusivity (/c) 

m 2 s -1 

1.4 X 10 -6 

3 X 10 -6 

2.5 X 10 -6 

Dynamic viscosity ( 17 ) 

kgm -1 s -1 

1 X 10 21 

2.5 X 10 21 

2 X 10 21 

Kinematic viscosity (v) 

? -1 

rrr s 1 

2.7 X 10 17 

4.5 X 10 17 

4.3 X 10 17 

Rayleigh number (Ra r ) 

— 

7000 

180,000 

820,000 


low, adjacent lamina of the fluid move past each other 
under the conditions for Newtonian viscosity (Section 
2.8.2). At faster flow rates this condition breaks down, and 
the flow becomes turbulent. The conditions favoring turbu¬ 
lence are high momentum (pr) and large scale D of the flow, 
whereas it is inhibited by high viscosity 17 . These factors are 
contained in the Reynolds number. Re, defined as 

pvD 

Re = '^ r (4.68) 

Reasonable values for the mantle are p = 5000 kg nr 3 , D = 
2900km = 2.9x 10 6 m, v = 5cmyr _1 = 1.5X 10 -9 ms -1 , 
and 77 = 1.5 X 10 21 Pa s. The Reynolds number is found to 
be Re= 1.5 X 10 _2 °, which is so small that turbulence is 
negligible. Similar results are found by considering the 
upper or lower mantle alone. Clearly, although mantle 
convection involves high Rayleigh numbers (implying vig¬ 
orous convection on a geological timescale), it takes place 
by laminar flow. 

The effect of convection is to replace conduction as the 
principal mechanism of heat transfer. A measure of the 
relative effectiveness of the two processes of heat transfer 
is the Nusselt number, Nu. This is defined as the ratio of 
the heat transport in the presence of convection to the 
heat transport without convection. In the absence of radi¬ 
ogenic heat sources, the heat transport with convection is 
determined by the Rayleigh number Ra^ while the non- 
convective heat transport is expressed by the critical 
Rayleigh number Ra c . The Nusselt number depends on 
the ratio of these two numbers and can be written 


Nu = 


P 


/Ra r \s 

VR %/ 


(4.69) 


where the coefficient p and the exponent 5 are functions 
of the aspect ratio of the convection cells. Mathematical 
evaluation of the problem of Rayleigh-Benard convec¬ 
tion with stress-free upper and lower boundaries gives p 
= 1 and 5=1/3, and, since in this case Ra c ~10 3 , the 
Nusselt number has the simpler form 


Table 4.9 Approximate aspect ratios of some mantle 
convection cells, estimatedfrom the horizontal dimensions 
of the overlying lithospheric plates (after Turcotte and 
Schubert, 1982, Table 7.5) 


Plate 

Upper-mantle 

convection 

Whole-mantle 

convection 

Pacific 

14 

3.3 

North American 

11 

2.6 

South American 

11 

2.6 

Indian 

8 

2.1 

Nazca 

6 

1.6 


Nu = 0.1(Ra r ) 1/3 (4.70) 

Using the estimated values of Ra r in Table 4.8 gives 
Nusselt numbers of 19 for layered convection in the upper 
mantle and 97 for whole-mantle convection. Hence, heat 
transfer by convection is dominant in the mantle. 

Once convection has been initiated the boundary condi¬ 
tions determine the shapes of the convection cells. In 
Rayleigh-Benard convection the aspect ratio of a cell - the 
ratio of its horizontal dimension to its vertical one - is 2 1/2 
= 1.41; when the layer has rigid boundaries the cell aspect 
ratio is 1.01. Hence, the horizontal extent of a convection 
cell is comparable with the layer thickness. This has impli¬ 
cations for convection in the mantle. If we assume that the 
scale of mantle convection is represented by the pattern of 
the plate boundaries (Fig. 1.11), we can estimate the hori¬ 
zontal dimensions of the convection cells. It is evident that 
they must have very different sizes. The ridge-to-trench 
horizontal distances across major plates are in the range 
2000-10,000 km, with an average of about 5000 km. This is 
larger than the maximum thickness of the convecting layer, 
whether we assume convection to be restricted to the upper 
mantle or to occupy the entire 2900 km thickness of the 
sub-lithospheric mantle. Thus, if convection is uniform 
through the whole mantle, the aspect ratios of at least 
some cells must be much larger than unity (Table 4.9). The 
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Fig. 4.37 Possible convection 
flow pattern ( center) and 
profiles of viscosity p. (left), 
and density p, temperature T 
and solidus temperature 0 
(right) for (a) whole-mantle 
convection and (b) layered 
mantle convection. TZ is the 
upper-mantle transition zone, 
BL are boundary layers, CMB 
is the core-mantle boundary 
(based upon Peltier etai, 
1989 ). 
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reason for this is the rigidity of the cold upper boundary 
formed by the lithosphere, which inhibits the breakup of 
the fluid flow into cells with smaller horizontal extents. 

4.2.93 Models of mantle convection 

The feasibility of mantle convection is accepted but there is 
still some doubt as to the form it takes. This is in part due to 
uncertainty as to the role played by the seismic discontinu¬ 
ities at 400 km and 670 km depth, which bound the upper- 
mantle transition zone. The discontinuities are not sharp, 
and are understood to represent mineral phase changes 
rather than compositional differences (as, for example, the 
crust-mantle and core-mantle boundaries). The upper dis¬ 
continuity marks the olivine-spinel phase change, the lower 
one represents the phase change from spinel to perovskite 
structure (Section 3.7.5.2), with accompanying changes in 
density and elastic parameters. In principle, mass can be 
carried by convection currents across these discontinuities. 
The 670 km discontinuity is close to the maximum depth of 
seismicity in subduction zones, and may be where the sub¬ 
ducting plate is absorbed into the mantle. 

There are two main models of mantle convection, each 
with an interface at the 670 km seismic discontinuity. An 
important change in viscosity occurs at this level. In whole- 
mantle convection (Fig. 4.37a) the viscosity doubles from 
the upper mantle to the lower mantle (see Table 4.8) and 
there is a net flow of material across the boundary. In this 
model, convection ensures that the entire mantle is well 
mixed mechanically, and the phase changes at 400 and 
670 km have only a small effect on the temperature gradi¬ 
ent. This model agrees with much of the available evidence. 


The alternative layered convection model has distinct 
convecting layers in the upper and lower mantle (Fig. 
4.37b). There are two ways in which this can take place. 
The upper and lower convection patterns in a vertical 
section may represent circulations in the same sense (e.g., 
both clockwise or both counterclockwise) or in opposite 
senses (e.g., one clockwise and the other counterclock¬ 
wise). In each case the radial velocity is zero at 670 km 
depth and there is no mass transfer across the discontinu¬ 
ity; the material in each flow pattern spreads out along the 
boundary. However, the models imply different types of 
coupling between the layers. Opposite senses of circulation 
in the layers would cause little or no shear between the tan¬ 
gential flows at the boundary, resulting in mechanical cou¬ 
pling between the layers. Cold material sinking in the 
upper mantle would overlie hot material rising in the lower 
mantle. However, if the layered flow patterns have the same 
sense of circulation (as in Fig. 4.37b), hot material rising in 
the upper mantle overlies hot material rising in the lower 
mantle, so that the flow regimes are coupled thermally. 
This model has a strong velocity shear across the 670 km 
discontinuity, which requires a large and abrupt change in 
viscosity at this depth; viscosity in the lower mantle would 
need to be at least two orders of magnitude smaller than in 
the upper mantle. Estimates of mantle viscosity (Section 
2.8.6) indicate the opposite: viscosity is higher in the lower 
mantle than in the upper mantle. 

A model of layered convection assumes that there is no 
mass transfer across the discontinuity. The upper and 
lower mantles are well mixed individually, but the separa¬ 
tion of the flow patterns at the discontinuity means that 
they may have distinct chemical compositions. Because 
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Fig. 4.38 An idealized cross- 
section through the mantle, 
showing convective flow and 
the relationship of mantle 
plumes to the D"-layer (after 
Stacey, 1992). 



there is no convective flow across it, heat can only cross 
the boundary by conduction. The 670 km discontinuity 
therefore acts as a thermal boundary, with a large temper¬ 
ature change of perhaps 500-1000 K across it. Thus, the 
temperature profile in the lower mantle, although main¬ 
tained adiabatic by the convection, would be 500-1000 K 
higher than in whole-mantle convection. This would 
result in a smaller temperature change across the 
core-mantle boundary, a less-steep temperature gradient 
in the D"-layer, and so a lower heat flux from the core. The 
long-term rate of cooling of the Earth would thereby be 
reduced. 

The problem of understanding mantle convection is 
complicated by the non-uniform structure and rheology of 
the mantle. As yet, there is no complete picture of how the 
various factors that influence convection act together. The 
convection pattern depends strongly on what happens 
physically and thermodynamically at the 670 km disconti¬ 
nuity. This can only be inferred indirectly. Our understand¬ 
ing of the discontinuity is incomplete, but it is essential to 
resolving the real pattern of mantle convection. 

4.2.94 Mantle plumes 

The viscosity in the upper mantle is inferred from post¬ 
glacial rebound studies to be around 10 21 Pa s, but lower- 
mantle viscosity is less well known. The sub-solidus creep 
in the mantle implies a temperature-dependent viscosity, 
which allows thermal boundary layers at the top and 
bottom of the mantle to influence the patterns of convec¬ 
tive flow. 

The lithosphere constitutes an upper, cold boundary 
layer. It accretes at high temperature at spreading ridges, 
where upwelling magma from the mantle reaches the 
surface. The eruptive lavas issue from magma chambers 
beneath ridge crests, in which magma from the deeper 
mantle undergoes differentiation. As part of the plate tec¬ 
tonic cycle the lithosphere rapidly cools and hardens as it 
spreads away from the ridge. Its high viscosity (i.e., rigid¬ 
ity) inhibits internal convection, but at subduction zones 
the plate (by now old) flexes downward and carries cold 


material into the underlying mantle, altering its thermal 
balance. Seismic tomography (Section 3.7.6) has revealed 
broad regions of raised seismic velocity in the deep 
mantle below subduction zones, giving rise to the surmise 
that the material in the cold subducted plate eventually 
sinks to the bottom of the mantle. The material must 
eventually take part in a broad-scale return flow, complet¬ 
ing the convective cycle, but how this takes place is not 
clear. 

The core-mantle boundary (CMB) at 2890 km depth 
constitutes a lower, hot boundary layer. The D"-layer at 
the base of the mantle (Section 3.7.5.3) is characterized 
by reductions in seismic velocities between about 2740 km 
depth and the CMB. It evidently has different physical 
properties than the mantle above it, and appears to play 
an important thermodynamic role. The heat flux from the 
core to the mantle diminishes the rigidity of the layer, 
thus reducing the seismic velocities. The viscosity of the 
hot thin D"-layer is presumed to be much lower than that 
of the overlying mantle. The topography of the CMB has 
been explored by seismic waves reflected from the core or 
passing it at grazing incidence. The thickness of the D" 
layer appears to be uneven, and has been interpreted by 
analogy to the crust. Thick segments have been desig¬ 
nated as crypto-continents and thinner regions as crypto¬ 
oceans (Fig. 4.38). 

The low-viscosity material in the Delayer is thought to 
supply relatively fast-flowing narrow mantle plumes. This 
name is given to vertical features, thin in cross-section, 
that facilitate the upwelling of low-viscosity hot magma 
through the more viscous mantle. A new plume melts its 
way to the surface behind a larger head. Some plumes 
may not reach the surface but intrude their material into 
the asthenosphere or lower lithosphere. Other mature 
plumes may penetrate the entire mantle and reach the 
surface, where they are evident as places of persistent vol- 
canism, high regional topography and local high heat 
flow, called hotspots. These areas of anomalous volcan- 
ism are found in the oceans and on the continents, within 
plates and on plate margins. The plumes that feed them 
are thought to remain fixed in position for long periods of 
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time, and so the hotspots are anchored to the mantle 
below the lithosphere. As a result they have important 
consequences for studies of plate tectonic motions. 


4.3 GEOELECTRICITY 

4.3.1 Introduction 


the electric current in a conductor and the voltage of the 
battery was established in 1827 by Georg Ohm, a German 
physicist. The magnetic effects produced by electric cur¬ 
rents were established in the early nineteenth century by 
Oersted, Ampere, Faraday and Lenz. Their contributions 
are discussed in more detail in a later chapter (Section 
5.1.3) on the physical origins of magnetism. 


Electric charge - together with mass, length and time - is 
a fundamental property of nature. The name electric 
derives from the Greek word for amber (“elektron”), the 
naturally occurring fossilized resin of coniferous trees 
that has been used since antiquity in the making of 
jewelry. The Greek philosopher Thales of Miletus (ca. 
600 BC) is credited with first reporting the power of 
amber, when rubbed with a cloth, to attract light objects. 
The ancient sages could not understand this behavior in 
terms of their everyday world, and so, together with the 
power of magnetism possessed by natural lodestone (see 
Section 5.1.1), electricity remained a wonderful but 
unknown phenomenon for more than two millennia. In 
1600 AD the English physician William Gilbert summa¬ 
rized previous investigations and extant knowledge in the 
first systematic study of these phenomena. 

In the following century it was established that there 
were two types of electric charge, now referred to as posi¬ 
tive and negative. Objects that carried like types of charge 
were observed to repel each other, and those that carried 
opposite types were attracted to each other. In 1752 the 
American statesman, diplomat and scientist Benjamin 
Franklin performed a celebrated experiment; by flying a 
kite during a thunderstorm, he established that lightning 
is an electrical phenomenon. Flaving survived this risky 
endeavor Franklin developed the far-sighted theory that 
electricity consisted of an omnipresent fluid, and that the 
different types of charge represented surplus and scarcity 
of this fluid. This view strikingly resembles modern 
theory, in which the “fluid” consists of electrons. 

The laws of electrostatic attraction and repulsion 
were established in 1785 as a result of careful experi¬ 
ments by a French scientist, Charles Augustin de 
Coulomb (1736-1806), who also established the laws of 
magnetostatic force (Section 5.1.3). Coulomb invented a 
sensitive torsion balance, with which he could measure 
accurately the force between electrically charged spheres. 
FTis results represent the culmination of knowledge of 
electrostatic phenomena. 

The eighteenth century concept of electricity as a fluid 
finds further expression in electrical nomenclature. 
Electricity is said to flow between charged objects when 
they are brought in contact, and the rate of flow is called 
an electric current. The study of the properties and effects 
of electric currents became possible around 1800, when an 
Italian physicist, Alessandro Volta (later elevated by 
Napoleon to the rank of Count), invented a primitive elec¬ 
tric battery, called a voltaic pile, in which electricity was 
produced by chemical action. The relationship between 


4.3.2 Electrical principles 

Coulomb established that the force of attraction or repul¬ 
sion between two charged spheres was proportional to the 
product of the individual electric charges and inversely 
proportional to the square of the distance between the 
centers of the spheres. His law can be written as the fol¬ 
lowing equation: 

F=K^P (4.71) 

where Q 1 and Q 2 are the electric charges, r is their separa¬ 
tion and K is a constant. This inverse-square law strongly 
resembles the law of universal gravitation (Eq. (2.2)), for¬ 
mulated by Newton more than a century before Coulomb’s 
law. However, in gravitation the force is always attractive, 
whereas in electricity it may be attractive or repulsive, 
depending on the nature of the charges. In the law of grav¬ 
itation the units of mass, distance and force are already 
defined, so that the gravitational constant is predeter¬ 
mined; only its numerical value needed to be measured. In 
Coulomb’s law, Fand r are defined from mechanics (as the 
newton and meter, respectively), but the units of Q and K 
are undefined. The value of K was originally set equal to 
unity, thereby defining the unit of electric charge. This def¬ 
inition led to unfortunate complications when the mag¬ 
netic effects of electric currents were analyzed. The 
alternative is to define independently the unit of charge, 
thereby fixing the meaning of the constant K. 

The unit of charge is the coulomb (C), defined as the 
amount of charge that passes a point in an electrical 
circuit when an electric current of one ampere (A) flows 
for one second (i.e., 1 C = 1A s). In turn, the ampere is 
defined from the magnetic effects of a current (see 
Section 5.2.4). When a current flows in the same direc¬ 
tion through two parallel long straight conductors, mag¬ 
netic fields are produced around the conductors, which 
cause them to attract each other. If the current flows 
through the conductors in opposite directions they repel 
each other. The ampere is defined as the current that 
produces a force of 2X10 -7 N per meter of length 
between infinitely long thin conductors that are one 
meter apart in vacuum. Thus, the unit of charge is 
defined precisely, if rather indirectly. In the Systeme 
Internationale (SI) units Fis written as (4 -tt e 0 ) _1 , so that 
Coulomb’s law becomes 


F _ i Q\Qi 

4tte 0 r 2 


(4.72) 
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where the constant e 0 is called the permittivity 
constant. It is approximately equal to 8.854 187 X 10 -12 
C 2 N" 1 m“ 2 . 

Modern electrical theory descends from the discovery 
in 1897 by the English physicist Joseph J. Thomson of the 
electron as the basic elementary unit of electric charge. It 
has a negative charge of 1.602 X 10 -19 C. A proton in the 
nucleus of an atom has an equal positive charge. 
Normally an atom contains as many electrons as it has 
protons in its nucleus and is electrically neutral. If an 
atom or molecule loses one or more electrons, it has a net 
positive charge and is called a positive ion; similarly, a neg¬ 
ative ion is an atom or molecule with a surplus of elec¬ 
trons. 

In metals, some electrons are only loosely bound to the 
atoms. They can move with relative ease through the 
material, which is called an electrical conductor. Metals 
like copper and silver are good conductors. In other mate¬ 
rials, called insulators, the electrons are tightly bound to 
the atoms. Glass, rubber, and dry wood are typical insula¬ 
tors. A perfect insulator does not allow electrons to move 
through it, whereas a perfect conductor offers no opposi¬ 
tion to the passage of electrons. Real conductors offer 
different degrees of opposition. 

A flow of charge, or electric current, results when the 
free electrons in a conductor move in a common direc¬ 
tion. A current of one ampere corresponds to a flow of 
about 6,250,000,000,000,000,000 electrons per second 
past any point of a circuit! The direction of an electric 
current is defined to be the direction of flow of positive 
charge, which is opposite to the direction of motion of 
the electrons. 


4.3.2.1 Electric field and potential 

The force exerted on a unit electric charge by another 
charge Q is called the electric field of the charge Q. Thus, 
if we let Q t = Q and g 2 = 1 in Eq. (4.72), we obtain 
the equation for the electric field E at distance r from a 
charge Q 


E = 


Q 

4Tre 0 r 2 


(4.73) 


According to this definition E has the dimensions of 
newton/coulomb (N C -1 ). 

The term “field” also has another connotation, intro¬ 
duced by Michael Faraday (1791-1867) to refer to the 
geometry of the lines of force near a charge. Around a posi¬ 
tive point charge the field lines are directed radially 
outward, describing the (divergent ) direction along which a 
free positive charge would move (Fig. 4.39a); around a neg¬ 
ative point charge they are directed radially inward (con¬ 
vergent) (Fig. 4.39b). The field lines of a pair of opposite 
point charges diverge from the positive charge, spread apart 
and converge on the negative charge (Fig. 4.39c); they give 
the appearance of drawing the opposite charges together. 




Fig. 4.39 Planar cross-sections of electric field lines around point 
charges: (a) single positive, (b) single negative, (c) two equal and 
opposite, and (d) two equal positive charges. 


The combined field of two positive point charges is charac¬ 
terized by field lines that leave each charge and diverge in 
the space between (Fig. 4.39d); the field lines appear visibly 
to push the like charges apart. The direction of the electric 
field at any point is tangential to the electric field line. The 
strength of the field is represented by the spatial concentra¬ 
tion of the field lines. Close to either electrical charge the 
field is strong and it weakens with increasing distance from 
the charge. Consequently, work is required to move a 
charged particle from one point in the field to another. This 
work contributes to the potential energy of the system. 

For example, at an infinite distance from a positive 
charge Q the repulsive force on a unit positive charge is 
zero, but at a distance r it is given by Eq. (4.73). The 
potential energy of the unit charge at r is called the elec¬ 
tric potential at r; we will denote it U. The units of U are 
energy per unit charge, i.e. joules/coulomb. If we move a 
distance dr against the field E, the potential changes by an 
amount d U equal to the work done against E, which is 
(— Edr). i.e., d U= — Edr, so that 



We can readily compute the electric potential U at r by 
integration: 
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U= — \E&r= - f ^ , dr 
j, J 4'7TE 0 ?' 2 

from which 


(4.75) 


U = 


Q 

4iTe 0 r 


(4.76) 


The energy needed to move a unit charge from one 
point to another in the electric field of Q is the potential 
difference between the two points. The unit of potential 
difference is the same as that of U (i.e., joules/coulomb) 
and is called a volt. From Eq. (4.74) we obtain the more 
common alternative units of volt/meter (Vm -1 ) for the 
electric field E. 

Electric charge flows from a point with higher poten¬ 
tial to a point with lower potential. The situation is anal¬ 
ogous to the flow of water through a pipe from one level 
to a lower level. The rate of flow of water through the 
pipe is determined by the difference in gravitational 
potential between the two levels. Likewise, the electric 
current in a circuit depends on the potential difference in 
the circuit. 


4.3.22 Ohm's law 

The German scientist Georg Simon Ohm established in 
1827 that the electric current I in a conducting wire is pro¬ 
portional to the potential difference V across it. The 
linear relationship is expressed by the equation 

V=IR (4.77) 

where R is the resistance of the conductor. The unit of 
resistance is the ohm (Cl). The inverse of resistance is 
called the conductance of a circuit; its unit is the recipro¬ 
cal ohm (ft -1 ), variously also called a mho or siemens (S). 

Experimental observations on different wires of the 
same material showed that a long wire has a larger resis¬ 
tance than a short wire, and a thin wire has a larger resis¬ 
tance than a thick wire. Formulated more precisely, for a 
given material the resistance is proportional to the length 
L and inversely proportional to the cross-sectional area A 
of the conductor (Fig. 4.40). These relationships are 
expressed in the equation 

R = PJ (4.78) 

The proportionality constant p is the resistivity of the 
conductor. It is a physical property of the material of the 
conductor, which expresses its ability to oppose a flow of 
charge. The inverse of p is called the conductivity of the 
material, denoted cr. The unit of resistivity is the ohm- 
meter (ft m); the unit of conductivity is the reciprocal 
ohm-meter (ft -1 m -1 ). 

If we substitute Eq. (4.78) for R in Eq. (4.77) and 
rearrange the terms we get the following expression: 

z-<b (4 79 » 


potential 

potential = U + dU 



. j • - t current I 

current density / = - = — 

J area A 


Fig. 4.40 Parameters used to define Ohm's law for a straight 
conductor. 

The ratio VIL on the left side of this equation is, by 
comparison with Eq. (4.74), the electric field E (assuming 
the potential gradient to be constant along the length of 
the conductor). The ratio HA is the current per unit cross- 
sectional area of the conductor; it is called the current 
density and denoted J (Fig. 4.40). We can now rewrite 
Ohm’s law as 


E = pJ (4.80) 

This form is useful for calculating the formulas used in 
resistivity methods of electrical surveying. However, the 
quantities that are measured are V and I. 

4.32.3 Types of electrical conduction 

Electric current passes through a material by one of three 
different modes: by electronic, dielectric, or electrolytic 
conduction. Electronic (or ohmic) conduction occurs in 
metals and crystals, dielectric conduction in insulators, 
and electrolytic conduction in liquids. 

Electronic conduction is typical of a metal. The free elec¬ 
trons in a metal have a high average speed (about 1.6 X 10 6 
m s -1 in copper). They collide with the atoms of the metal, 
which occupy fixed lattice sites, and bounce off in random 
directions. When an electric field is applied, the electrons 
acquire a common drift velocity, which is superposed on 
their random motions, so that they move at a much smaller 
speed (about 4X 10 -5 ms -1 in copper) in the direction of 
the field. The resistivity is determined by the mean free time 
between collisions. If the atomic arrangement causes fre¬ 
quent collisions, the resistivity is high, whereas a long mean 
free time between collisions results in low resistivity. The 
energy lost in the collisions appears in the form of heat. 

A form of semiconduction is important in some crys¬ 
tals, such as the silicate minerals. The resistivity of the 
mineral is higher than that of a conductor but lower than 
that of an insulator, and it is called a semiconductor. 
Different types of semiconduction are possible. Silicates 
contain fewer conduction electrons than a metal, but the 
electrons are not rigidly bound to atoms as in an insulator. 
The energy needed to liberate additional electrons from 
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their atoms is not large, and thermal excitation is enough 
to allow them to take part in electronic semiconduction. 
The liberated electron leaves a vacancy or hole in the 
valence level of the atomic structure, which behaves as a 
positive charge. Natural crystals also contain impurity 
atoms, which may have a different valency than that 
required by the lattice for charge balance. The impurity is 
a source of holes or excess electrons which take part in 
impurity semiconduction. At high temperature ions may 
detach from the lattice; they behave like ions in an elec¬ 
trolyte and give rise to electrical currents by ionic semi- 
conduction. A potential difference across a semiconductor 
produces an electric current made up of opposite flows of 
negative electrons and positive holes. If most of the 
current is carried by the negative electrons, the semicon¬ 
ductor is called n-type; if the positive holes predominate, 
the semiconductor is said to be p-type. 

Dielectric conduction occurs in insulators, which 
contain no free electrons. Normally, the electrons are dis¬ 
tributed symmetrically about a nucleus. However, an 
electric field displaces the electrons in the direction oppo¬ 
site to that of the field, while the heavy nucleus shifts 
slightly in the direction of the field. The atom or ion 
acquires an electric polarization and acts like an electric 
dipole. The net effect is to change the permittivity of the 
material from e 0 to a different value e, given by 

n = e r e 0 (4.81) 

Here, s r is called the relative permittivity. When it is mea¬ 
sured in a constant electric field, the relative permittivity is 
called the dielectric constant, k, of the material. It is 
dimensionless, and has a value commonly in the range 
3-80. Examples of k for some natural materials are: air 
1.00059; mica 3; glass 5; sandstone 5-12; granite 3-19; 
diorite 6; basalt 12; water 80. Dielectric effects are unim¬ 
portant in constant current situations. However, in an 
alternating electric field the polarization changes with the 
frequency of the field, and thus the relative permittivity is 
frequency dependent. The fluctuating polarization of the 
electric charge contributes to the alternating current, and 
so modifies the effective conductivity or resistivity. In 
practice, this effect depends strongly on the frequency of 
the inducing alternating field. The higher the frequency, 
the greater is the effect of dielectric conduction. Some geo¬ 
electric methods utilize signals in the audio-frequency 
range, where dielectric conduction is insignificant, but 
ground-penetrating radar uses frequencies in the MHz to 
GHz range and depends on dielectric contrasts. 

Electrolytic conduction occurs in aqueous solutions that 
contain free ions. The water molecule is polar (i.e., it has a 
permanent electric dipole moment) with a strong electric 
field which breaks down molecules of dissolved salts into 
positively and negatively charged ions. For example, in a 
saline solution the molecule of sodium chloride (NaCl) 
dissociates into separate Na + and Cl - ions. The solution is 
called an electrolyte. The ions in the electrolyte are mobi¬ 
lized by an electric field, which causes a current to flow. 


Electric charge is transported by positive ions in the direc¬ 
tion of the field and negative ions in the opposite direction. 
The resistivity of an electrolyte may be understood by 
analogy with the flow of water through a partially blocked 
pipe. The electric current in the electrolyte involves the 
physical transport of material (ions), which results in colli¬ 
sions with the molecules of the medium (electrolyte), 
causing resistance to the flow. Ionic conduction is conse¬ 
quently slower than electronic conduction. 

4.3.3 Electrical properties of the Earth 

In our daily lives we experience frequent reminders of the 
Earth’s gravity field. It is less obvious that the Earth also 
has an electric field. Its presence mainly becomes evident 
during thunderstorms, when electrical discharges take 
place as lightning. The Earth’s electric field acts radially 
inward, so that the Earth behaves like a negatively charged 
sphere. At its surface the vertically downward electric field 
amounts to about 200 Vm -1 . The atmosphere has a net 
positive charge, equal and opposite to that of the Earth, 
and resulting from the distribution of positively and nega¬ 
tively ionized air molecules. The charges originate from the 
continual bombardment of the Earth by cosmic rays. 

Cosmic rays are subatomic particles with very high 
energy. Primary cosmic rays reach the Earth from outer 
space, travelling at velocities close to that of light. They 
consist largely of protons (hydrogen nuclei) and a-particles 
(helium nuclei), with lesser amounts of other ions. Their 
origin is still unknown. Some are emitted by the Sun at the 
time of solar flares, but these occur too infrequently to be 
the main source. This source lies elsewhere in our galaxy. It 
is thought that a large proportion of the galactic cosmic 
rays are accelerated to high speed by supernova explosions. 
The path of a cosmic ray is easily deflected by a magnetic 
field. Even the weak interstellar magnetic field is enough to 
disperse fast-moving cosmic rays, so that they reach the 
Earth equally from all directions. The incoming particles 
collide with nuclei in the upper atmosphere, producing 
showers of secondary cosmic rays, consisting of protons, 
neutrons, electrons and other elementary particles. 
Consequently, at any given time a fraction of the molecules 
of the atmosphere are electrically charged. The Earth’s 
electric field accelerates positive particles downward to the 
Earth’s surface, where they neutralize negative surface 
charges. This would rapidly eliminate the negative surface 
charge, which is maintained by thunderstorm activity. 

Thunderstorms, and the causes of lightning, are not yet 
fully understood. A possible scenario is the following. In a 
storm-cloud, droplets of water vapor become electrically 
charged. The Earth’s downward electric field may cause 
polarization within a droplet, with positive charge on the 
bottom and negative on the top. When the drop is heavy 
enough to fall, negative charges from molecules pushed out 
of its path are attracted to the bottom while fewer positive 
charges gather on the top. As a result the droplet becomes 
negatively charged. The wind action in storm-clouds causes 
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the negative charge to accumulate at the base of the cloud, 
while a corresponding positive charge gathers in its upper 
extent. When the potential difference between the two 
charges exceeds the break-down voltage of the atmosphere, 
a brief but powerful electric current flows. Most lightning 
strokes occur within the storm-cloud. However, the nega¬ 
tive charge on the base of the cloud repels the negative 
charge on the ground surface beneath it. Once again, if the 
potential difference between the cloud and the ground 
becomes large enough to overcome the break-down voltage 
of the air, a lightning stroke ensues. This carries negative 
charge to the ground. In this way, the numerous daily light¬ 
ning storms that occur worldwide maintain the negative 
charge of the Earth. 

To a first approximation the Earth may be regarded as 
a uniform electrical conductor. Electric charges on the 
surface of a conductor disperse so that the electric poten¬ 
tial is the same at all points on the surface, i.e., it is an elec¬ 
trical equipotential surface. The surface potential is 
commonly used as the reference level for electrical poten¬ 
tial energy and is defined to be zero. Thus a positively 
charged body has a positive potential difference (voltage) 
with respect to ground, while a negatively charged body 
has a negative voltage. 

4.3.3.1 Electrical surveying 

As with other physical parameters, geoelectrical properties 
are utilized in both applied and general geophysics. They 
are exploited commercially in the search for valuable ore- 
bodies, which may be located by their anomalous electrical 
conductivities. Deep electrical sounding provides valuable 
information about the internal structure of the Earth's 
crust and mantle. Electrical surveys may be based on 
natural sources of potential and current. More commonly, 
they involve the detection of signals induced in subsurface 
conducting bodies by electric and magnetic fields gener¬ 
ated above ground. Investigations in this category include 
resistivity and electromagnetic methods. These techniques 
have long been used in commercial geophysical surveying. 
In recent years they have also become important in the sci¬ 
entific investigation of environmental problems. The elec¬ 
trical techniques require the measurement of potential 
differences in the ground between suitably implanted elec¬ 
trodes. The electromagnetic techniques detect subsurface 
conductivity anomalies remotely; they do not need contact 
with the ground. As well as being employed in surface 
surveys they are especially suited to airborne use. 

The important physical properties of rocks for electri¬ 
cal surveying are the permittivity (for georadar) and the 
resistivity (or conductivity), on which several techniques 
are based. Anomalies arise, for example, when a good 
conductor (such as a mineralized dike or orebody) is 
present in rocks that have higher resistivities. The resistiv¬ 
ity contrast between orebody and host rock is often large, 
because the resistivities of different rocks and minerals 
vary widely (Fig. 4.41). In metallic ores the resistivity can 
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Fig. 4.41 Ranges of electrical resistivity for some common rocks, soils 
and ores (data source: Ward, 1990; augmented by data from Telford 
eta/., 1990). 

be very low, but igneous rocks that contain no water can 
have a very high resistivity. For example, in a high-grade 
pyrrhotite ore p is of the order of 10 -5 fl m, while in dry 
marble it is around 10 s fl m. The range between these 
extremes spans 13 orders of magnitude. Moreover, the 
resistivity range of any given rock type is wide and over¬ 
laps with other rock types (Fig. 4.41). 

The resistivity of rocks is strongly influenced by the 
presence of groundwater, which acts as an electrolyte. This 
is especially important in porous sediments and sedimen¬ 
tary rocks. The minerals that form the matrix of a rock are 
generally poorer conductors than groundwater, so the 
conductivity of a sediment increases with the amount of 
groundwater it contains. This depends on the fraction of 
the rock that consists of pore spaces (the porosity, <)>), and 
the fraction of this pore volume that is water filled (the 
water saturation , S). The conductivity of the rock is pro¬ 
portional to the conductivity of the groundwater, which is 
quite variable because it depends on the concentration 
and type of dissolved minerals and salts it contains. These 
observations are summarized in an empirical formula, 
called Archie’s law, for the resistivity p of the rock 

P = (4.82) 

By definition </> and S are fractions between 0 and 1, p w is 
the resistivity of the groundwater, and the parameters a, 
m and n are empirical constants that have to be deter¬ 
mined for each case. Generally, 0.5 s a s 2.5,1.3 ^ m < 2.5 
and n = 2. 
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4.3.4 Natural potentials and currents 

Electrical investigations of natural electrical properties are 
based on the measurement of the voltage between a pair of 
electrodes implanted in the ground. Natural differences in 
potential occur in relation to subsurface bodies that create 
their own electric fields. The bodies act like simple voltaic 
cells; their potential arises from electrochemical action. 
Natural currents (called telluric currents) flow in the crust 
and mantle of the Earth. They are induced electromagneti- 
cally by electric currents in the ionosphere (described in 
Section 5.4.3.2). In studying natural potentials and cur¬ 
rents the scientist has no control over the source of the 
signal. This restricts the interpretation, which is mostly 
only qualitative. The natural methods are not as useful as 
controlled induction methods, such as resistivity and elec¬ 
tromagnetic techniques, but they are inexpensive and fast. 

4.3.4.1 Self-potential (spontaneous potential) 

A potential that originates spontaneously in the ground is 
called a self-potential (or spontaneous potential). Some 
self-potentials are due to man-made disturbances of the 
environment, such as buried electrical cables, drainage 
pipes or waste disposal sites. They are important in the 
study of environmental problems. Other self-potentials 
are natural effects due to mechanical or electrochemical 
action. In every case the groundwater plays a key role by 
acting as an electrolyte. 

Some self-potentials have a mechanical origin. When 
an electrolyte is forced to flow through a narrow pipe, a 
potential difference (voltage) may arise between the ends 
of the pipe. Its amplitude depends on the electrical resis¬ 
tivity and viscosity of the electrolyte, and on the pressure 
difference that causes the flow. The voltage is due to 
differences in the electrokinetic or streaming potential, 
which in turn is influenced by the interaction between the 
liquid and the surface of the solid (an effect called the 
zeta-potential). The voltage can be positive or negative 
and may amount to some hundreds of millivolts. This 
type of self-potential can be observed in conjunction with 
seepage of water from dams, or the flow of groundwater 
through different lithological units. 

Most self-potentials have an electrochemical origin. 
For example, if the ionic concentration in an electrolyte 
varies with location, the ions tend to diffuse through the 
electrolyte so as to equalize the concentration. The 
diffusion is driven by an electric diffusion potential, which 
depends on the temperature as well as the difference in 
ionic concentration. When a metallic electrode is inserted 
in the ground, the metal reacts electrochemically with the 
electrolyte (i.e., groundwater), causing a contact poten¬ 
tial. If two identical electrodes are inserted in the ground, 
variations in concentration of the electrolyte cause 
different electrochemical reactions at each electrode. A 
potential difference arises, called the Nernst potential. The 
combined diffusion and Nernst potentials are called the 
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Fig. 4.42 A schematic model of the origin of the self-potential 
anomaly of an orebody. The mechanism depends on differences in 
oxidation potential above and below the water table. 


electrochemical self-potential. It is temperature sensitive 
and may be either positive or negative, amounting to at 
most a few tens of millivolts. 

The self-potentials that originate by the above mecha¬ 
nisms are attracting increased attention in environmental 
and engineering situations. However, in the exploration for 
subsurface regions of mineralization they are often smaller 
than the potentials associated with orebodies and are classi¬ 
fied accordingly as “background potentials.” The self-poten- 
tial associated with an orebody is called its “mineralization 
potential.” Self-potential (SP) anomalies across orebodies 
are invariably negative, amounting usually to a few hundred 
millivolts. They are most commonly associated with sulfide 
ores, such as pyrite, pyrrhotite, and chalcopyrite, but also 
with graphite and some metallic oxides. 

The origin of the mineralization type of self-potential 
is still obscure, despite decades of applied investigations. 
At one time it was thought that the effect arose from gal¬ 
vanic action. This occurs when dissimilar metal electrodes 
are placed in an electrolyte. Unequal contact potentials 
are formed between the metals and the electrolyte, giving 
rise to a potential difference between the electrodes. 
According to this model an orebody behaves like a simple 
voltaic cell, with groundwater acting as the electrolyte. It 
was believed that oxidation of the part of the orebody 
above the water table produced a potential difference 
between the upper and lower parts, causing a spontaneous 
electric polarization of the body. Oxidation involves the 
addition of electrons, so the top of the orebody becomes 
negatively charged, explaining the observed negative 
anomalies. Unfortunately, this simple model does not 
explain many of the observed features of self-potential 
anomalies and has proved to be untenable. 

Another mechanism for self-potential depends on vari¬ 
ations in oxidation (redox) potential with depth (Fig. 4.42). 
The ground above the water table is more accessible to 
oxygen than the submersed part, so moisture above the 
water table contains more oxidized ions than that below it. 
An electrochemical reaction takes place at the surface 
between the orebody and the host rock above the water 
table. It results in reduction of the oxidized ions in the 
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adjacent solution. An excess of negative ions appears 
above the water table. A simultaneous reaction between the 
submersed part of the orebody and the groundwater 
causes oxidation of the reduced ions present in the ground- 
water. This produces excess positive ions in the solution 
and liberates electrons at the surface of the orebody, which 
acts as a conductor connecting the two half-cells. Electrons 
flow from the deep part to the shallow part of the orebody. 
Outside the orebody, positive ions move from bottom to 
top along the electric field lines. The equipotential surfaces 
are normal to the field lines. The self-potential is measured 
where they intersect the ground surface (Fig. 4.42). 

The redox model is inadequate for the same reason as 
the galvanic model; it fails to account for many of the 
observed features of self-potential anomalies. In particu¬ 
lar, the association of self-potential models with the water 
table has been cast in doubt. Moreover, sulfide orebodies 
appear to persist for geological lengths of time, so that a 
mechanism involving permanent flow of charge appears 
unlikely. Self-potential is a feature of a stable system that 
is perturbed by making an electrical connection between 
the host rock and the sulfide conductor through the 
inserted electrodes and their connecting wire. The 
observed potential difference appears to be due to the 
difference in oxidation potential between the locations of 
the measurement electrodes, one inside and the other 
outside the zone of mineralization. 

4.34.2 SP surveying 

The equipment needed for an SP survey is very simple. 
It consists of a sensitive high-impedance digital volt¬ 
meter to measure the natural potential difference between 
two electrodes implanted in the ground. Simple metal 
stakes are inadequate as electrodes. Electrochemical reac¬ 
tions take place between the metal and moisture in the 
ground, causing the build-up of spurious charges on the 
electrodes, which can falsify or obscure the small natural 
self-potentials. To avoid or minimize this effect non-polar- 
izable electrodes are used. Each electrode consists of a 
metal rod submersed in a saturated solution of its own 
salt; a common arrangement is a copper rod in copper 
sulfate solution. The combination is contained in a 
ceramic pot which allows the electrolyte to leak slowly 
through its porous walls, thereby making electrical 
contact with the ground. 

Two field methods are in common use (Fig. 4.43). The 
gradient method employs a fixed separation between the 
electrodes, of the order of 10 m. The potential difference is 
measured between the electrodes, then the pair is moved 
forward along the survey line until the trailing electrode 
occupies the location previously occupied by the leading 
electrode. The total potential at a measurement station rel¬ 
ative to a starting point outside the study area is found 
by summing the incremental potential differences. Some 
electrode polarization is unavoidable, even with non- 
polarizable electrodes. This gives rise to a small error in 
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Fig. 4.43 The field techniques of measuring self-potential by (a) the 
gradient method and (b) the total field method. The total potential V at 
a station in the gradient method is found by summing the previous 
potential differences At/; in the total field method l/is measured directly. 


each measurement; these add up to a cumulative error in 
the total potential. The polarization effects can sometimes 
be reduced by interchanging the leading and trailing elec¬ 
trodes. In this “leapfrog” technique the leading electrode 
for one measurement is kept in place and becomes the trail¬ 
ing electrode for the next measurement; meanwhile the pre¬ 
vious trailing electrode is moved ahead to become the 
leading electrode. Cumulative error is the most serious dis¬ 
advantage of the fixed electrode configuration. A practical 
advantage of the technique is that only a short length of 
connecting wire must be moved along with the electrodes. 

The total field method utilizes a fixed electrode at a 
base station outside the area of exploration and a mobile 
measuring electrode. With this method the total potential 
is measured directly at each station. The wire connecting 
the electrodes has to be long enough to allow good cover¬ 
age of the area of interest. This necessitates a long wire 
that must be wound or unwound on a reel for each mea¬ 
surement station. However, the total field method results 
in smaller cumulative error than the gradient method. It 
allows more flexibility in placing the mobile electrode and 
usually gives data of better quality. Hence, the total field 
method is usually preferred except in difficult terrain. 

The surveying procedure with each technique consists 
of measuring potential at discrete stations along a profile. 
As in gravity and magnetic surveys, the data are mapped 
(Fig. 4.44) and interpretations of anomalies are based on 
their geometry. Methods used to interpret self-potential 
anomalies are often qualitative or are based on simple 
geometric models. Visual inspection of mapped anomalies 
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Fig. 4.44 Hypothetical contour lines of a negative self-potential 
anomaly over an orebody; the asymmetry of the anomaly along the 
profile AB suggests that the orebody dips toward A. 

may reveal trends related to elongation of the orebody; 
crowding of contour lines can indicate its orientation. 
Profiles plotted in known directions across the anomaly 
can be compared with curves generated from simple 
models of the source. For example, a polarized sphere may 
be used to model the source of approximately circular 
anomalies, while a horizontal line source (or polarized 
cylinder) may be used to model an elongate anomaly. A 
common and effective method is to model SP anomalies 
with point sources; complex anomalies are modelled with 
combinations of sources and sinks. 

43.4.3 Telluric currents 

Ultraviolet radiation from the Sun ionizes molecules of air 
in the thin upper atmosphere of the Earth. The ions accu¬ 
mulate in several layers, forming the ionosphere (see 
Section 5.4.3.2) at altitudes between about 80 km and 
1500 km above the Earth’s surface. Electric currents in the 
ionosphere arise from systematic motions of the ions, 
which are affected by various factors such as the daily and 
monthly tides, seasonal variations in insolation and the 
periodic fluctuation in ionization related to the 11 -yr 
sunspot cycle. The currents produce varying magnetic 
fields with the same frequencies, which are observed at the 
surface of the Earth and can be analyzed from long-term 
continuous records of the geomagnetic field. The ionos¬ 
pheric effects show up in the energy spectrum of the geo¬ 
magnetic field as distinct peaks representing periods that 
range from fractions of a second (geomagnetic pulsations) 
to several years (Fig. 4.45). The magnetic fields induce 



Fig. 4.45 (a) The frequency spectrum of natural variations in the 
horizontal intensity of the geomagnetic field, and (b) the corresponding 
spectrum of induced electric field fluctuations, computed for a model 
Earth with uniform resistivity 20 ft m (after Serson, 1973). 

fluctuating electric currents, called telluric currents , that 
flow in horizontal layers in the crust and mantle. The 
current pattern consists of several huge whorls, thousands 
of kilometers across, which remain fixed with respect to 
the Sun and thus move around the Earth as it rotates. 

The distribution of telluric current density depends on 
the variation of resistivity in the horizontal conducting 
layers. At shallow crustal depths the lines of current flow 
are disturbed by subsurface structures which cause con¬ 
trasts in resistivity. These could arise from geological struc¬ 
tures or the presence of mineralized zones. Consider, for 
example, a buried anticline which has a highly resistive 
rock (such as granite) as its core and is overlain by a con¬ 
ducting layer of porous sedimentary rocks saturated with 
groundwater. The horizontal flow of telluric current across 
the anticline chooses the less-resistive path through the 
conducting sediments. The current lines bunch together 
over the axis of the anticline, increasing the horizontal 
current density (Fig. 4.46). The equipotential surfaces 
normal to the current lines intersect the ground surface, 
where potential differences can be measured with a 
high-impedance voltmeter. 

The field equipment for measuring telluric current 
density is simple. The sensors are a pair of non-polariz- 
able electrodes with a fixed separation L of the order of 
10-100 m. The potential difference V between the elec¬ 
trodes is measured with a high-impedance voltmeter. The 
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Fig. 4.46 Telluric current lines are deflected by changes in thickness of 
a conducting layer over a more resistive structure (bottom). The telluric 
current density (top) is obtained from the voltage measured between a 
pair of fixed-separation electrodes at the surface (after Robinson and 
goruh, 1988). 


electric field E at a point mid-way between the electrodes 
can be assumed to be VIL. Using Ohm's law (Eq. (4.80)) 
and assuming that the telluric current flows in conducting 
rock layer with resistivity p v the telluric current density J 
at each measurement station along a profile is given by 



(4.83) 


The direction of the telluric current is not known, so 
two pairs of electrodes oriented perpendicular to each 
other are used. One pair is aligned north-south, the other 
east-west. Telluric currents vary unpredictably with time, 
but they change only slowly within a homogeneous 
region. To keep track of the temporal changes an orthog¬ 
onal pair of electrodes is set up at a fixed base station 
outside the area to be explored. Another orthogonal pair 
is moved across the survey area. The potential differences 
across each electrode pair in the mobile and base arrays 
are recorded simultaneously for several minutes at each 
measurement station. Correlation of the records allows 
removal of the temporal changes in direction and inten¬ 
sity of the telluric currents. 

The deflection of telluric current by a resistive subsur¬ 
face structure as shown in Fig. 4.46 is greatly idealized. 
It assumes an infinite resistivity p 1 in the core of the 
anticline. In practice, the current is not completely diverted 
through the better-conducting layer; part flows through 
the more resistive layer as well. Thus we cannot assume 
that the resistivity p x in Eq. (4.83) corresponds to the good 
conductor. Rather, it represents some undefined mixture of 
the values p, and p 1 . It is not the true resistivity of either 
layer, but the apparent resistivity of the measurement. 


4.3.5 Resistivity surveying 

The large contrast in resistivity between orebodies and 
their host rocks (see Fig. 4.41) is exploited in electrical 
resistivity prospecting, especially for minerals that occur 
as good conductors. Representative examples are the 
sulfide ores of iron, copper and nickel. Electrical resistiv¬ 
ity surveying is also an important geophysical technique 
in environmental applications. For example, due to the 
good electrical conductivity of groundwater the resistiv¬ 
ity of a sedimentary rock is much lower when it is water¬ 
logged than in the dry state. 

Instead of relying on natural currents, two electrodes 
are used to supply a controlled electrical current to the 
ground. As in the telluric method, the lines of current flow 
adapt to the subsurface resistivity pattern so that the 
potential difference between equipotential surfaces can be 
measured where they intersect the ground surface, using a 
second pair of electrodes. A simple direct current can cause 
charges to accumulate on the potential electrodes, which 
results in spurious signals. A common practice is to com¬ 
mutate the direct current so that its direction is reversed 
every few seconds; alternatively a low-frequency alternat¬ 
ing current may be used. In multi-electrode investigations 
the current electrode-pair and potential electrode-pair are 
usually interchangeable. 

4.3.5.1 Potential of a single electrode 

Consider the flow of current around an electrode that 
introduces a current / at the surface of a uniform half¬ 
space (Fig. 4.47a). The point of contact acts as a current 
source, from which the current disperses outward. The 
electric field lines are parallel to the current flow and 
normal to the equipotential surfaces, which are hemi¬ 
spherical in shape. The current density J is equal to I 
divided by the surface area, which is 2ttr 2 for a hemi¬ 
sphere of radius r. The electric field E at distance r from 
the input electrode is obtained from Ohm’s law (Eq. 
(4.80)) 

E = pJ=P 2tt? (4 ' 84) 

Putting this expression in Eq. (4.74) yields the electric 
potential U at distance r from the input electrode: 

d U = _ I 
dr P 2ttr 2 

u =e^ ' 4 85 » 

If the ground is a uniform half-space, the electric field 
lines around a source electrode, which supplies current to 
the ground, are directed radially outward (Fig. 4.47b). 
Around a sink electrode, where current flows out of the 
ground, the field lines are directed radially inward (Fig. 
4.47c). The equipotential surfaces around a source or 
sink electrode are hemispheres, if we regard the electrode 
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Fig. 4.48 General four-electrode configuration for resistivity 
measurement, consisting of a pair of current electrodes (A, B) and a pair 
of potential electrodes (C, D). 



All quantities in this equation can be measured at the 
ground surface except the resistivity, which is given by 
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Fig. 4.47 Electric field lines and equipotential surfaces around a single 
electrode at the surface of a uniform half-space: (a) hemispherical 
equipotential surfaces, (b) radially outward field lines around a source, 
and (c) radially inward field lines around a sink. 

in isolation. The potential around a source is positive and 
diminishes as 1/r with increasing distance. The sign of /is 
negative at a sink, where the current flows out of the 
ground. Thus, around a sink the potential is negative and 
increases (becomes less negative) as 1/r with increasing 
distance from the sink. We can use these observations to 
calculate the potential difference between a second pair 
of electrodes at known distances from the source and 
sink. 


43.5.3 Special electrode configurations 

The general formula for the resistivity measured by a four- 
electrode method is simpler for some special geometries of 
the current and potential electrodes. The most commonly 
used configurations are the Wenner, Schlumberger and 
double-dipole arrangements. In each configuration the 
four electrodes are collinear but their geometries and spac- 
ings are different. 

In the Wenner configuration (Fig. 4.49a) the current 
and potential electrode pairs have a common mid-point 
and the distances between adjacent electrodes are equal, 
so that r AC = r DB = a , and r CB = r AD = 2a. Inserting these 
values in Eq. (4.89) gives 


43.5.2 The general four-electrode method 

Consider an arrangement consisting of a pair of current 
electrodes and a pair of potential electrodes (Fig. 4.48). 
The current electrodes A and B act as source and sink, 
respectively. At the detection electrode C the potential due 
to the source A is +pI/(2irr AC ), while the potential due to 
the sink B is -pH(2'nr CB ). The combined potential at C is 
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Similarly, the resultant potential at D is 
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In the Schlumberger configuration (Fig. 4.49b) the 
current and potential pairs of electrodes often also have a 
common mid-point, but the distances between adjacent 
electrodes differ. Let the separations of the current and 
potential electrodes be L and a, respectively. Then r AC = 
r DB = (L - a)l2 and r AD = r CB = (L + a)l2. Substituting in 
the general formula, we get 
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The potential difference measured by a voltmeter con¬ 
nected between C and D is 


tt v( L 1 — q 2 \ 

4 1\ a j 


(4.92) 
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In this configuration the separation of the current electro- 
(4.88 ) des is kept much larger than that of the potential electrodes 
(. L>a ). Under these conditions, Eq. (4.92) simplifies to 
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Fig. 4.49 Special geometries of current and potential electrodes for (a) 
Wenner, (b) Schlumberger and (c) double-dipole configurations. 


d = vV(lI\ (4.93) 

p 4 I\aj 

In the double-dipole configuration (Fig. 4.49c) the 
spacing of the electrodes in each pair is a, while the dis¬ 
tance between their mid-points is L , which is generally 
much larger than a. Note that detection electrode D is 
defined as the potential electrode closer to current sink B. 
In this case r AD = r BC = L, r AC = L + a, and r BD = L — a. 
The measured resistivity is 

p-^|(i-r±i)-(r±i-i)] 

p = ^( L( ^~ fl2) ) < 4 ' 95 > 

Two modes of investigation can be used with each elec¬ 
trode configuration. The Wenner configuration is best 
adapted to lateral profiling. The assemblage of four elec¬ 
trodes is displaced stepwise along a profile while maintain¬ 
ing constant values of the inter-electrode distances 
corresponding to the configuration employed. The separa¬ 
tion of the current electrodes is chosen so that the current 
flow is maximized in depths where lateral resistivity 
contrasts are expected. Results from a number of profiles 
may be compiled in a resistivity map of the region of inter¬ 
est. The regional survey reveals the horizontal variations in 
resistivity within an area at a particular depth. It is best 
suited to locating steeply dipping contacts between 
rocks with a strong resistivity contrast and good conduc¬ 


source sink surface 



Fig. 4.50 Cross-section of current "tubes" and equipotential surfaces 
between a source and sink; numbers on the current lines indicate the 
fraction of current flowing above the line (after Robinson and (Joruh, 
1988; based upon Van Nostrand and Cook, 1966). 

tors such as mineralized dikes, which may be potential ore- 
bodies. 

In vertical electrical sounding ( VES ) the goal is to 
observe the variation of resistivity with depth. The tech¬ 
nique is best adapted to determining depth and resistivity 
for flat-lying layered rock structures, such as sedimentary 
beds, or the depth to the water table. The Schlumberger 
configuration is most commonly used for VES investiga¬ 
tions. The mid-point of the array is kept fixed while the 
distance between the current electrodes is progressively 
increased. This causes the current lines to penetrate to 
ever greater depths, depending on the vertical distribution 
of conductivity. 


43.5.4 Current distribution 

The current pattern in a uniform half-space extends later¬ 
ally on either side of the profile line. Viewed from above, 
the current lines bulge outward between source and sink 
with a geometry similar to that shown in Fig. 4.39c. In a 
vertical section the current lines resemble half of a dipole 
geometry. In three dimensions the current can be visual¬ 
ized as flowing through tubes that fatten as they leave the 
source and narrow as they converge towards the sink. 
Figure 4.50 shows the flow pattern of the current in a ver¬ 
tical section through the “tubes” in a uniform half-space. 

In order to evaluate the depth penetration of current 
in a uniform half-space we define orthogonal Cartesian 
coordinates with the x-axis parallel to the profile and the 
c-axis vertical (Fig. 4.51a). Let the spacing of the current 
electrodes be L and the resistivity of the half-space be p. 
The horizontal electric field E x at (x, y, z) is 


di i = 

dX ax[2TTV'l r 2 )j 


(4.96) 


where r x = (x 2 + y 2 + z 2 ) m and r 2 = ((L - x) 2 + y 1 + z 2 ) m . 
Differentiating and using Ohm’s law (Eq. (4.80)) gives the 
horizontal current density J x at (x, y, z)\ 
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Fig. 4.51 (a) Geometry for determining current density in uniform 
ground below two electrodes, and (b) fraction of current (I/I) that flows 
above depth z across the median plane between current electrodes 
with spacing L (after Telford etal., 1990). 


J = 


I / yv 

2TT\r3 


L — 
i ) 


(4.97) 


If (x, y, z) is on the vertical plane mid-way between the 
current electrodes, x = LI 2, r x = and the current density 
is given by 


j _ IL _ 1 _ 

* 2tt ((L/2) 2 + y 2 + z 2 ) 3/2 


(4.98) 


The horizontal current d/ v across an element of area 
(dydz) in the median vertical plane is d/ Y = J x dv dr. 
The fraction of the input current / that flows across 
the median plane above a depth z is obtained by inte¬ 
gration: 


if, Y d y 

2rrJ o _Jj(L/2) 2 + y 2 + z 2 ) 3/2 

(4.99) 

L T dz 

77 { ((L/2) 2 + z 2 ) 

(4.100) 

2. _i2z 

= —tan 
"IT L 

(4.101) 


Equation (4.101) shows that I x depends upon the current- 
electrode spacing L (Fig. 4.51b). Half the current crosses 
the plane above a depth z = LI 2, and almost 90% passes 
above the depth z = 3L. The fraction of current between 
any two depths is found from the difference in the frac¬ 
tions above each depth calculated with Eq. (4.101). 


(a) electrode configuration 



(b) current distributions 




(c) apparent resistivity 



Fig. 4.52 (a) Parameters of the four-electrode arrangement, (b) 
distribution of current lines in a two-layer ground with resistivities p, 
and p 2 (p, >p 2 ) and (c) the variation of apparent resistivity as the current 
electrode spacing is varied for the two cases of p, >p 2 and p 1 <p 2 . 


43.5.5 Apparent resistivity 

In the idealized case of a perfectly uniform conducting 
half-space the current flow lines resemble a dipole pattern 
(Fig. 4.50), and the resistivity determined with a four-elec¬ 
trode configuration is the true resistivity of the half-space. 
But in real situations the resistivity is determined by 
different lithologies and geological structures and so may 
be very inhomogeneous. This complexity is not taken into 
account when measuring resistivity with a four-electrode 
method, which assumes that the ground is uniform. The 
result of such a measurement is the apparent resistivity of 
an equivalent uniform half-space and generally does not 
represent the true resistivity of any part of the ground. 

Consider a horizontally layered structure in which a 
layer of thickness d and resistivity p l overlies a conducting 
half-space with a lower resistivity p 1 (Fig. 4.52). If the 
current electrodes are close together, so that L<d, all or 
most of the current flows in the more resistive upper layer, 
so that the measured resistivity is close to the true value of 
the upper layer, p l . With increasing separation of the 
current electrodes the depth reached by the current lines 
increases. Proportionally more current flows in the less 
resistive layer, so the measured resistivity decreases. 
Conversely, if the upper layer is a better conductor 
than the lower layer, the apparent resistivity increases with 
























264 Earth's age, thermal and electrical properties 


increasing electrode spacing. When the electrode separa¬ 
tion is much larger than the thickness of the upper layer 
( L>d) the measured resistivity is close to the value p 2 of 
the bottom layer. Between the extreme situations the 
apparent resistivity determined from the measured current 
and voltage is not related simply to the true resistivity of 
either layer. 


43.5.6 Vertical electrical sounding 

A two-layer situation is encountered often in electrical 
prospecting, for example when a conducting overburden 
overlies a resistive basement. It is also common in environ¬ 
mental applications, when the conducting water table lies 
under drier, more resistive soil or rocks. Before the advent 
of portable computers, two-layer cases were interpreted 
with the aid of characteristic curves. These theoretical 
curves, calculated for a particular four-electrode array, 
take into account the change in depth penetration when 
current lines cross the boundary to a layer with different 
resistivity. The electrical boundary conditions require con¬ 
tinuity of the component of current density J normal to 
the interface and of the component of electric field E tan¬ 
gential to the interface. At a boundary the current lines 
behave like optical or seismic rays, and are guided by 
similar laws of reflection and refraction. For example, if 0 
is the angle between a current line and the normal to the 
interface, the electrical “law of refraction” is 


tan0j p 2 
tan0 2 - Pi 


(4.102) 


In a set of characteristic curves the apparent resistivity 
p a is normalized by the resistivity p t of the upper layer and 
the electrode spacing is expressed as a multiple of the layer 
thickness. The shape of the curve of apparent resistivity 
versus electrode spacing depends on the resistivity con¬ 
trast between the two layers, and a family of characteristic 
curves is calculated for different ratios of p 2 lp l (Fig. 4.53). 
The resistivity contrast is conveniently expressed by a k- 
factor defined as 


Pi _Pi 

P 2 + Pi 


(4.103) 


The A'-factor ranges between -1 and +1 as the resistivity 
ratio p 2 /pj varies between 0 and The characteristic 
curves, drawn as full logarithmic plots on a transparent 
overlay, are compared graphically with the field data to 
find the best-fitting characteristic curve. The comparison 
yields the resistivities p x and p 1 of the upper and lower 
layers, respectively, and the layer thickness, d. 

Although characteristic curves can also be computed 
for the interpretation of structures with multiple horizon¬ 
tal layers, modern VES analyses take advantage of the 
flexibility offered by small computers with graphic 
outputs on which the apparent resistivity curves can be 
assessed visually. The first step in the analysis consists of 
classifying the shape of the vertical sounding profile. 



Fig. 4.53 Characteristic curves of apparent resistivity for a two-layer 
structure using the Wenner array; parameters are defined in the inset. 


The apparent resistivity curve for a three-layer structure 
generally has one of four typical shapes, determined by the 
vertical sequence of resistivities in the layers (Fig. 4.54). 
The type K curve rises to a maximum then decreases, indi¬ 
cating that the intermediate layer has higher resistivity 
than the top and bottom layers. The type H curve shows 
the opposite effect; it falls to a minimum then increases 
again due to an intermediate layer that is a better conduc¬ 
tor than the top and bottom layers. The type A curve may 
show some changes in gradient but the apparent resistivity 
generally increases continuously with increasing electrode 
separation, indicating that the true resistivities increase 
with depth from layer to layer. The type Q curve exhibits 
the opposite effect; it decreases continuously along with a 
progressive decrease of resistivity with depth. 

Once the observed resistivity profile has been identi¬ 
fied as of K, H, A or Q type, the next step is equivalent to 
one-dimensional inversion of the field data. The technique 
involves iterative procedures that would be very time-con¬ 
suming without a fast computer. The method assumes the 
equations for the theoretical response of a multi-layered 
ground. Each layer is characterized by its thickness and 
resistivity, each of which must be determined. A first esti¬ 
mate of these parameters is made for each layer and the 
predicted curve of apparent resistivity versus electrode 
spacing is computed. The discrepancies between the 
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Fig. 4.54 The four common 
shapes of apparent resistivity 
curves for a layered structure 
consisting of three horizontal 
layers. 


(a) type K p ? 


Pi <P2>P3 
A Pa 





Effective electrode spacing 



Effective electrode spacing 


(d) type Q 

Pi > Pi > P3 

A Pa 



I 



observed and theoretical curves are then determined 
point by point. The layer parameters used in the govern¬ 
ing equations are next adjusted, and the calculation is 
repeated with the corrected values, giving a new predicted 
curve to compare with the field data. Using modern com¬ 
puters the procedure can be reiterated rapidly until the 
discrepancies are smaller than a pre-determined value. 

The inversion method is equivalent to matching auto¬ 
matically the observed and theoretical curves. A one¬ 
dimensional analysis accommodates only the variations 
of resistivity and layer thickness with depth. The response 
of a vertically layered structure has an analytical solu¬ 
tion, so efficient inversion algorithms can be established. 
In recent years, procedures have been proposed that also 
take into account lateral heterogeneities. The response of 
two- or three-dimensional structures must be approxi¬ 
mated by a numerical solution, based on the finite- 
difference or finite-element techniques. The number of 
unknown quantities increases, as do the computational 
difficulties of the inversion. 

43.5.7 Induced polarization 

If commutated direct current is used in a four-electrode 
resistivity survey, the sequence of positive and negative flow 
may be interspersed with periods when the current is off. 
The inducing current then has a box-like appearance (Fig. 
4.55a). When the current is interrupted, the voltage across 
the potential electrodes does not drop immediately to zero. 
After an initial abrupt drop to a fraction of its steady-state 
value it decays slowly for several seconds (Fig. 4.55b). 
Conversely, when the current is switched on, the potential 
rises suddenly at first and then gradually approaches the 
steady-state value. The slow decay and growth of part of 


(a) inducing current 
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(b) measured potential 




Fig. 4.55 (a) Illustration of the IP-related decay of potential after 
interruption of the primary current, (b) Effect of the IP decay 
time on the potential waveform for a square-wave input current. 

the signal are due to induced polarization, which results 
from two similar effects related to the rock structure: mem¬ 
brane polarization and electrode polarization. 
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Membrane polarization is a feature of electrolytic con¬ 
duction. It arises from differences in the ability of ions in 
pore fluids to migrate through a porous rock. The miner¬ 
als in a rock generally have a negative surface charge and 
thus attract positive ions in the pore fluid. They accumu¬ 
late on the grain surface and extend into the adjacent 
pores, partially blocking them. When an external voltage 
is applied, positive ions can pass through the “cloud" of 
positive charge but negative ions accumulate, unless the 
pore size is large enough to allow them to bypass the 
blockage. The effect is like a membrane, which selectively 
allows the passage of one type of ion. It causes temporary 
accumulations of negative ions, giving a polarized ionic 
distribution in the rock. The effect is most pronounced in 
rocks that contain clay minerals; firstly, because the grain 
and pore sizes are small, and, secondly, because clay 
grains are relatively strongly charged and adsorb ions on 
their surfaces. The ionic build-up takes a short time after 
the voltage is switched on; when the current is switched 
off, the ions drift back to their original positions. 

Electrode polarization is a similar effect that occurs 
when ore minerals are present. The metallic grains conduct 
charge by electronic conduction, while electrolytic conduc¬ 
tion takes place around them. However, the flow of elec¬ 
trons through the metal is much faster than the flow of ions 
in the electrolyte, so opposite charges accumulate on facing 
surfaces of a metallic grain that blocks the path of ionic 
flow through the pore fluid. An overvoltage builds up for 
some time after the external current is switched on. The 
size of the effect is commensurate with the metallic concen¬ 
tration. After the current is switched off, the accumulated 
ions disperse and the overvoltage decays slowly. 

The two effects responsible for induced polarization are 
indistinguishable at measurement level. The field method 
for an induced polarization (IP) survey is most often based 
on the double-dipole array. The current electrodes form a 
transmitter pair, while the potential electrodes form a 
receiver pair. The steady-state voltage V () is recorded and 
compared with the amplitude of the decaying residual 
voltage V(t) at time t after the current is interrupted (Fig. 
4.55c). The ratio V(t)/V 0 is expressed as a percentage, 
which decays during the 0.1-10 s between switching the 
current on and off. If the decay curve is sampled at many 
points, its shape and the area under the curve may be 
obtained (Fig. 4.55d). The area under the decay curve, 
expressed as a fraction of the steady-state voltage, is called 
the chargeability M, defined as 

M=4r\v(t) dt (4.104) 

°{ 

M has the dimensions of time and is expressed in seconds 
or milliseconds. It is the most commonly used parameter 
in IP studies. 

The induced polarization determines the length of the 
potential decay time. If it is shorter than the time when the 
inducing current is off, successive half-cycles of the poten¬ 


tial will not interfere. However, if a disseminated conductor 
is present, the decay time increases, causing overlap and dis¬ 
tortion of the half-cycles. The higher the signal frequency 
the more pronounced is the effect. It increases the ratio 
V(t)/V 0 , giving the impression of a better conductor than is 
really present (i.e., the apparent resistivity decreases with 
increasing frequency). Clearly, IP and resistivity surveys 
with alternating current are also influenced. The frequency 
dependence of the IP effect is exploited by measuring 
apparent resistivity at two low frequencies. Let these be / 
and F(>f). Commonly/=»().05 —0.5Hz and F~ l-10Hz. 
Then p f > p F and we can define a frequency effect as 

Pf~ Pf 

FE = ~pf~ (4.105) 

The ratio FE is often multiplied by 100 to express it as a 
percentage (PFE). If no IP effect is present the resistivity 
will be the same at both frequencies. The larger the value 
of FE or PFE, the greater is the induced polarization in 
the ground. At frequencies above 10 Hz, mutual induc¬ 
tance effects between the cables of the primary and detec¬ 
tion circuits can produce troublesome potentials, which 
must be avoided by the field procedure (such as restricting 
F ) or minimized analytically. 

The presence of metallic conductors is expressed by a 
similar parameter to FE, the metallic factor (MF). This is 
proportional to the difference in conductivities at the two 
measurement frequencies. 

MF = A(a F -cr f ) = A(p F l - pf 1 ) 

=y *(w) <410< ” 

The constant A is equal to 2 tt X 10 5 ; the units of MF are 
those of conductivity (i.e., II -1 m -1 or S m _1 ). 

An IP survey includes both lateral profiling and vertical 
sounding with the expanding spread method. Using a 
double-dipole array the distance between nearest elec¬ 
trodes of the transmitter and receiver pairs is a multiple 
( na ) of the electrode spacing a in each pair. Measurements 
are made at several discrete positions as the receiver pair is 
moved incrementally away from the fixed transmitter pair. 
The transmitter pair is then moved by one increment along 
the profile and the procedure is repeated. The value of p a , 
FE or MF obtained in each measurement is plotted below 
the mid-point of the array at the intersection of two lines 
inclined at 45° (Fig. 4.56a). Information is obtained from 
increasingly greater depths as the transmitter-receiver 
array expands (i.e., as n increases). The plotted value is, 
however, not the real value of the parameter at the indi¬ 
cated depth. (Recall, for example, that a measurement of 
apparent resistivity represents an equivalent half-space 
beneath the array.) A two-dimensional picture of the varia¬ 
tion of the IP parameter beneath the profile is synthesized 
by contouring the results (Fig. 4.56b). The plot is called a 
pseudo-section-, it provides a convenient (though artificial) 
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Fig. 4.56 (a) Construction of a pseudo-section for a double-dipole IP 
survey: the measured parameter is plotted at the intersection of 45° 
lines extending from the mid-points of the transmitter and receiver 
pairs, (b) Pseudo-sections of apparent resistivity and metallic factor for 
an IP survey over a sulfide orebody (redrawn from Telford etal., 1990). 


image of the presence of anomalous conductors, but does 
not represent their true lateral or vertical extent. The pres¬ 
ence of anomalous regions may be investigated further by 
exploratory drilling. 

Resistivity anomalies depend on the presence of con¬ 
tinuous conductors, such as groundwater or massive ore- 
bodies. If mineralization is disseminated through a rock it 
may not cause a significant resistivity anomaly. The good 
response of the IP method for disseminated concentra¬ 
tions of conducting ore minerals led to its development in 
base-metal exploration, where large low-grade orebodies 
may be commercially important. However, the IP effect 
also depends on the porosity and saturation of the rock. 
As a result, it can also be used in the search for groundwa¬ 
ter and in other environmental applications. 

43.5.8 Electrical resistivity tomography 

The availability of fast, inexpensive computers and 
the development of efficient algorithms has led to the 


development of electrical tomographic methods akin to 
the technique of seismic tomography described in Section 
3.7.6. Seismic tomography based upon teleseismic arrivals 
from earthquakes is used to describe regions deep in the 
Earth’s mantle that have anomalous seismic velocities. 
Seismic tomography using refracted and reflected signals 
from controlled sources can likewise be used to describe 
velocity perturbations due to shallow features in the 
Earth’s crust. In a similar way, electrical tomography is 
used to describe the resistivity structure of near-surface 
regions to depths of several tens of meters. In seismic 
tomography the observed travel-times are inverted to 
obtain the velocity structure along the path of a seismic 
ray. Analogously, in resistivity tomography, an inversion 
procedure is applied to the electrical potentials measured 
between electrode pairs to obtain the resistivity structure 
along the current flow lines. 

The methods of direct-current resistivity and induced 
polarization are readily adapted to tomographic analysis. 
Instead of deploying a single pair of current-electrodes 
and a single pair of potential-electrodes, an array of regu¬ 
larly spaced electrodes is deployed. For two-dimensional 
surveys a linear arrangement of electrodes is used; for 
three-dimensional investigations the electrodes form an 
areal array. Various combinations of current-electrode 
pairs and potential-electrode pairs are analyzed. The 
inversion computation is both complex and computer 
intensive. It yields a two- or three-dimensional vertical 
cross-section of the true resistivities beneath the electrode 
array. As in standard resistivity methods, the resolution 
and maximum depth of investigation depend on the sepa¬ 
ration and geometry of the electrodes. 

The application of direct-current resistivity tomogra¬ 
phy to an environmental problem is illustrated by an 
investigation of the extent of ice and permafrost in a 
buried Alpine rock glacier. The Murtel rock glacier on 
Mt. Corvatsch in Switzerland is a creeping, permanently 
frozen (permafrost) body. Its vertical structure is known 
from a drillhole through the body down to about 50 m 
depth below the surface. A resistivity tomographic survey 
employing more than 30 electrodes in Wenner configura¬ 
tions gave a vertical profile of resistivity in good agree¬ 
ment with the drillhole results, and described the lateral 
extent of the subsurface structure of the permafrost body 
in detail (Fig. 4.57). Ice has a much higher resistivity than 
sand or gravel. High resistivities of about 2 Mil m, corre¬ 
sponding to massive ice, are found between 5 m and 15 m 
depths in the rock glacier. Above and below the ice body, 
the resistivities are lower. The surface layer resistivity of 
~ 10 kll m is two orders of magnitude less than in the ice. 
The region below the ice body is interpreted to consist of 
frozen sand containing about 30% ice. Its resistivity is an 
order of magnitude lower than in the ice, but the lower 
boundary of the ice-block is not clearly defined. 
Resistivities are less than 5kllm in front of the rock 
glacier, marking the sharp transition to permafrost-free 
material. 
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Fig. 4.57 Electrical resistivity tomogram of the Murtel rock glacier, Mt. 
Corvatsch, Switzerland. The column on the left shows the vertical 
structure obtained from a borehole at the top of the resistivity profile. 
Solid lines delineate the bounds of the highly resistive, massive ice body. 
Dashed lines indicate the interpreted vertical boundary between the ice 
body and the permafrost-free material ahead of the glacier (after Hauck 
and Von der Muhll, 2003). 





Fig. 4.58 (a) An electric field E is generated by a changing magnetic 
field (dB/dt), while (b) a magnetic field B is produced by the current 
density J and the changing displacement-current density (<9D/dt); (c) in 
an electromagnetic wave an electric field E x and a magnetic field B y 
fluctuate normal to each other in the plane normal to the propagation 
direction (z-axis). 


4.3.6 Electromagnetic surveying 

The pioneering observations of electrical and magnetic 
phenomena early in the nineteenth century by Coulomb, 
Oersted, Ampere, Gauss and Faraday were unified in 
1873 by the Scottish mathematical physicist James Clerk 
Maxwell (1831-1879). His achievement is of similar 
stature to that of Newton in gravitation or Einstein in rel¬ 
ativity. Like Newton, Maxwell gathered existing knowl¬ 
edge and unified it in a way that allowed the prediction of 
other phenomena. His book A Treatise on Electricity and 
Magnetism was as important as Newton’s Principia to the 
further development of physics. Just as all discussions in 
dynamics start with Newton’s laws, all arguments in elec¬ 
tromagnetism begin with Maxwell’s equations. In partic¬ 
ular, he proposed the theory of the electromagnetic field, 
which classifies light as an electromagnetic phenomenon 
in the same sense as electricity and magnetism. This ulti¬ 
mately led to the recognition of the wave nature of matter. 
Unfortunately, Maxwell died while still in the prime of his 
career, before his theoretical predictions were verified. 
The German physicist Heinrich Hertz established the 
existence of electromagnetic waves experimentally in 
1887, eight years after Maxwell’s untimely death. 

Coulomb’s law shows that an electric charge is sur¬ 
rounded by an electric field, which exerts forces on other 
charges, causing them to move, if they are free to do so. 
Ampere’s law shows that an electric charge (or current) 
moving in a conductor produces a magnetic field propor¬ 
tional to the speed of the charge. If the electric field 
increases, so that the charge is accelerated, its changing 
velocity produces a changing magnetic field, which in turn 


induces another electric field in the conductor (Faraday’s 
law) and thereby influences the movement of the acceler¬ 
ated charge. The coupling of the electric and magnetic 
fields is called electromagnetism. If two straight conduc¬ 
tors are laid end-to-end and connected in series, they act 
as an electrical dipole. An alternating electric field applied 
to the conductors causes the dipole to oscillate, acting as 
an antenna for the emission of an electromagnetic wave. 
This consists of a magnetic field B and an electric field E, 
which vary with the frequency of the oscillator, and are 
oriented at right angles to each other in the plane perpen¬ 
dicular to the direction of propagation (Fig. 4.58). In a 
vacuum all electromagnetic waves travel at the speed of 
light (c = 2.99792458X 10 8 ms -1 , about 300,000kms -1 ), 
which is one of the fundamental constants of nature. 

The derivation of electromagnetic field equations from 
Maxwell’s equations is beyond the level of this textbook, 
but their meaning can be readily understood. Two equa¬ 
tions, identical in form, are obtained. They describe the 
propagation of the B and E field vectors, respectively, and 
are written as: 


V7' , TJ dB d 2 B 

V-B = p r p 0 o-jj- + A t r A t 0 e r E cf^2~ 

4E d 2 E 

V-E = p,p 0 o-^ + p T p 0 B r e 0 -^y 

where, in Cartesian coordinates. 


dx 2 dy 2 dz 2 


(4.107) 

(4.108) 







4.3 GEOELECTRICITY 


269 


Frequency 
(Hz) - 


y-rays 




10—I x-rays 


visible 

ligh V''io^ 


(1 pHz) 10 


|_ ^ A = 400 nm violet 

/' A = 550 nm green- 
yellow 

\ A = 700 nm red 


(10 GHz) 10 


I 


micro- 

waves 


100 MHz to 1 GHz 
GPR = ground 
penetrating radar 


—10* 


10- 


(150 kHz) • 


radar 

GPR 


radio+ 

IV 


j 


10“—| EM 

induction 


H 

10 ‘ 


10 ' 


-8 


magneto- 

tellurics 


diurnal & 
secular 
variations 


Wavelength 
12 (m) 


- 10 ' 


- 10 “ 


- 10 “' 


-10- 4 (100 qm). 


-10 “ (1 cm) 


X = 0.3 mm 


X = 3 cm 


-lm 


-10 z (100 m) 


X = 2 km 


-10 (10 km) 


-10° (1000 km) 

Period 

-1 s 

-—1000 s 
-1 day 

-_--l yr 


11 yr 


Fig. 4.59 The electromagnetic spectrum, showing the frequency and 
wavelength ranges of some common phenomena and the frequencies 
and periods used in electromagnetic surveying. 


In these equations a is the electrical conductivity, g r is 
the magnetic permeability, which in most materials (unless 
they are ferromagnetic) is very close to 1, /x () is the perme¬ 
ability constant (Section 5.2.2.1), or permeability of free 
space (p, 0 = 4tt X 10“ 7 N A -2 ), e r is the relative permittivity 
of the material, and e 0 is the permittivity constant, or per¬ 
mittivity of free space (e 0 = 8.854187 X 10“ 12 C 2 N _1 m“ 2 ). 
In constant electrical fields, e r is known as the dielectric 
constant, k. The value of k is— 5-20 in most rocks and 
minerals and 80 in water (Section 4.3.2.3). In sediments 
and sedimentary rocks, the water content plays an impor¬ 
tant role in determining the value of k. The value of e r 
increases with the frequency of the electrical field. 

Electromagnetic radiation encompasses a wide fre¬ 
quency spectrum. It extends from very high-frequency 
(short-wavelength) y-rays and x-rays to low-frequency 
(long-wavelength) radio signals (Fig. 4.59). Visible light 
constitutes a narrow part of the spectrum. Two ranges of 
electromagnetic radiation are of particular importance in 
solid Earth geophysics: a high-frequency range in the 
radar part of the spectrum, and a broad range of low 


frequencies extending from audio frequencies to signals 
with periods of hours, days or years. The electromagnetic 
equations reduce to simpler forms for these two particular 
frequency ranges. 

The left side of Eq. (4.108) describes the variation of 
the E-component of the electromagnetic wave in space. 
The right side describes its variation with time and so its 
frequency dependence. The first term is related to the 
familiar conduction of electricity in a conductor. Maxwell 
introduced the second term and called it the displacement 
current. It originates when charges are displaced but not 
separated from their atoms, causing an electric polariza¬ 
tion; fluctuations in the polarization have the effect of an 
alternating displacement current. Suppose that the electric 
dipole emitting E and B oscillates sinusoidally with 
angular frequency a>. Then | dEldt \ ~toE, and \B 2 E!dt 2 \~ 
co 2 E, so the magnitude ratio (MR) of the second (displace¬ 
ment) term to the first ( conduction ) term on the right side 
of Eq. (4.108) is 


MR = 


d 2 E 

A t O E 0 e r qj 2 


dE 


f^di\ 


B 0 B t CO 2 E s 0 s r 

crioE <r 


(4.109) 


where / is the frequency of the signal. The conductivity er 
of rocks and soils is generally in the range 10“ 5 to 10“' 
II 1 nr 1 ; in orebodies a may be as large as 10 3 to 10 5 
II 1 m 1 (see Fig. 4.41). Electromagnetic induction sur¬ 
veying is usually carried out at frequencies below 10 4 Hz, 
for which the magnitude ratio MR is much less than unity 
in both good and bad conductors. At these frequencies 
the electromagnetic signal passes through the ground in a 
diffusive manner, by conversion of the changing magnetic 
fields to electric currents and vice versa. High-frequency 
surveying employs radar signals with frequencies around 
10 8 Hz, for which the magnitude ratio MR is very small in 
an orebody but can be much greater than unity in rocks 
and soils. Under these conditions the electromagnetic 
signal propagates like a wave, and so is subject to 
diffraction, refraction and reflection. 

4.3.6.1 Electromagnetic induction 

Electromagnetic (EM) surveys carried out at frequencies 
below 50 kHz are based on the principle of electromagnetic 
induction. An alternating magnetic field in a coil or cable 
induces electric currents in a conductor. The conductivity 
of rocks and soils is too poor to permit significant induc¬ 
tion currents, but when a good conductor is present a 
system of eddy-currents is set up. In turn, the eddy currents 
produce secondary magnetic fields that are superposed on 
the primary field and can be measured at the ground 
surface (Fig. 4.60a). 

Suppose that a low-frequency plane wave propagates 
along the vertical z-axis. The displacement current is now 
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Fig. 4.60 (a) Illustration of primary and secondary fields in the 
horizontal loop induction method of electromagnetic exploration for 
shallow orebodies. (b) Amplitudes and phases of the primary (p) and 
secondary (s) fields. 


negligible compared to the conduction current and Eq. 
(4.107) becomes 


d 2 B 

dz 2 


dB 


W 7 at 


(4.110) 


where the magnetic field has components B x and B The 
form of this equation is reminiscent of the one-dimen¬ 
sional equation of diffusion or heat conduction (Eq. 
(4.52)), whose solution (Eq. (4.54)) describes how the 
temperature changes with time and position when a fluc¬ 
tuating temperature acts on the surface. By analogy, the 
solution of Eq. (4.110) for the components B x or B of an 
alternating magnetic field with angular frequency w 
(= 2tt /) in a conductor with conductivity a is 

B xy (z,t) = B 0 e ~ z/d cos ((ot -(4.111) 


where d = 


V 


2 

/JLqCTCx) 


(4.112) 


skin depth. Elowever, it decays to —1% at a depth z=5d 
and to —0.1% at z = ld, effectively limiting the practical 
depth of exploration with the induction method. 

The many field methods of EM induction have a 
common principle. A coil or cable is used as transmitter 
of the primary alternating magnetic field, while another 
coil serves as receiver of both the primary signal and a 
secondary signal from the eddy currents induced in a con¬ 
ductor (Fig. 4.60a). The magnetic field in the conductor 
experiences a phase shift (equal to z!d, Eq. (4.111)) due to 
the conductivity. This results in a phase difference 0 
between the secondary and primary signals in the receiver 
(Fig. 4.60b). The exact theory of EM induction is compli¬ 
cated, even in a simple situation, but we can obtain a 
simple qualitative appreciation by applying some con¬ 
cepts from electrical circuit theory. As in Section 4.2.6.1 
we will use complex numbers involving i = V-l. 

Let the current systems in transmitter, receiver and 
conductor be represented by simple loops carrying cur¬ 
rents 7 t , I v and 7 c , respectively. If the currents are sinu¬ 
soidal, each has the form 7=7 0 e lwt , so that d7/dr = io>7. Let 
the resistance of the conductor be R and its self-induc¬ 
tance be L. The voltage V c in the conductor is composed 
of two parts. A resistive part due to the current 7 c in the 
resistance R is equal to Iff. An inductive part due to the 
change of current is equal to LdIJdt. The complete 
voltage in the conductor is then 

F C = 7 C 7? + L^ = 7 C (7? + . W L) (4.113) 

The voltage V c is induced in the conductor by the 
changing current 7 t in the transmitter circuit. Let the 
mutual inductance between transmitter and conductor be 
M tc ; then V c = - M tc dljdt. Similarly, the transmitter 
current induces a primary voltage V =- M tr d7 ( /df in the 
receiver, in which the eddy currents in the conductor also 
induce a secondary voltage V s = - M ci dljdt. Here M tr 
and M cr are the mutual inductances between transmitter 
and receiver, and conductor and receiver, respectively. 
The following relationships exist between the different 
voltages and currents: 

V P = ~ M "di = ( 4 - 114 ) 


Here, d is called the skin depth. At this depth the magnetic 
field is attenuated to e~* (—37%) of its value outside the 
conductor. The skin depth is dependent on the conductiv¬ 
ity of the body and the frequency of the field. The skin 
depth in normal ground (or— 1(T 3 II -1 nr 1 ) for a low- 
frequency alternating magnetic field (/ —10 3 Hz) is 
about 500m but in an orebody (a— 10 4 fi 1 nr 1 ) it is 
only —16 cm. The comparable figures for a high-fre¬ 
quency radar signal (/ —10 9 Hz) are 50 cm and 0.16 mm, 
respectively. Note that the skin depth is not the maximum 
depth of penetration of the magnetic field. It helps to 
indicate how rapidly the field is attenuated, but the mag¬ 
netic field is effective at depths that are many times the 


K = “ M crd7 = “i" M c/c (4-115) 

c/7. 

V C = - M t0fft = - iwM tc 7 t (4.116) 


Combining Eq. (4.113) and Eq. (4.116) we get 


7 - i u>M - i (oM t 

7 t = r + mL = ( r 2 + co 2 l 2 ) (R ~ lcoL) 


(4.117) 


From Eq. (4.114) and Eq. (4.115) the ratio of V s to V p in 
the receiver is 

K = K7c = 47 tc M cr ((o 2 L + ifc>R) 

V P (R 2 + &> 2 L 2 ) 


(4.118) 
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which can be written in the form 

K MJc M tM/3 2 + i(3\ , |Q) 

F p M tr / t M tr L{ l+/ 3 2 J 

where (3 = coLIR is the response parameter of the conduc¬ 
tor. The function in parentheses in Eq. (4.119) is a 
complex number, so the voltage ratio (or response of the 
measuring system) can be written P+iQ. The real part P 
has the same phase as the primary signal and is called the 
in-phase component of the response. The imaginary part 
Q is 90° out of phase with the primary signal (i.e., if the 
primary signal is —cos cor, the imaginary part is —sin w t = 
cos [tof-TT/2]); it is called the quadrature component. 

43.6.2 EM induction surveying 

The resemblance of the basic equations of EM induction 
to the diffusion equation (Eq. (4.53)) classifies the method 
as a diffusive one. Diffusive techniques - for example, the 
gravity, magnetic field, geothermal and seismic surface- 
wave methods - respond to a volumetric average of the 
specific physical parameter and do not show fine detail of 
its distribution. Hence, EM induction yields an average 
value of the electrical conductivity in a particular volume, 
but the resolution is better than that of potential field 
methods. 

The EM induction method is very suitable for airborne 
surveys. These were carried out originally with fixed wing 
aircraft but they now more commonly use helicopters, 
which can adapt better to terrain roughness while flying 
close to the ground. The transmitter and receiver coils are 
mounted (usually with their axes coaxial or parallel to the 
line of flight) in fixed positions in the aircraft or in a towed 
“bird,” using configurations similar to those of airborne 
magnetometer surveys (see Fig. 5.44). Alternatively, the 
transmitter may be in the airplane and the receiver in the 
bird or in another airplane. The increased separation of 
transmitter and receiver in this configuration gives greater 
depth penetration. However, in flight the bird yaws and 
pitches, altering the separation and parallelism of the 
coils, so that normally only the quadrature component is 
usable. The flight patterns consist of parallel profiles tra¬ 
versing the terrain. Lines are flown at about 100 m above 
ground level with fixed-wing aircraft and 30 m with heli¬ 
copters. 

When a potential conductivity anomaly has been 
located from the air, it is usual to investigate it further 
with a ground-based EM induction method. The trans¬ 
mitter may be a long cable or large horizontal loop on 
the ground surface, or it may be a small coil (diameter 
— 1 m) with its axis vertical or horizontal. The receiver is 
usually a similar small coil. It can be used to detect the 
direction, intensity or phase of the secondary signal. In 
its most simple application the tilt of the coil about a 
horizontal axis measures the dip-angle of the combined 
primary and secondary fields at the receiver. The 



Fig. 4.61 (a) Geometry of an HLEM profile across a thin vertical dike. 

(b) In-phase and quadrature profiles over a dike at depth h/l =0.2 for 
some values of the dimensionless response parameter a. 

method allows location, outlining and, to some extent, 
depth determination of a conductor. However, the dip- 
angle method registers only part of the available infor¬ 
mation in the secondary signal and does not describe the 
electrical properties of the conductor. As shown by Eq. 
(4.119) these properties affect the phase and relative 
amplitudes of the in-phase and quadrature components 
of the secondary signal relative to the primary. Phase- 
component EM measurement methods, therefore, allow 
more detailed interpretation. 

The methods are illustrated by the horizontal loop elec¬ 
tromagnetic method (HLEM), popularly known also by 
the commercial names Slingram or Ronka. The receiver 
and transmitter are coupled by a fixed cable about 30 to 
100 m in length, and kept at a constant separation while the 
pair is moved along a traverse of a suspected conductor 
(Fig. 4.61a). The cable supplies a direct signal that exactly 
cancels the primary signal at the receiver, leaving only the 
secondary field of the conductor. This is separated into in- 
phase and quadrature components, which are expressed as 
percentages of the primary field and plotted against the 
position of the mid-point of the pair of coils (Fig. 4.61b). 
The in-phase and quadrature signals are zero far from the 
conductor and at the places where either the transmitter or 
receiver passes over the conductor. This enables the outline 
of a buried conductor to be charted. The signal rises to a 
positive peak on either side and falls to a negative peak 
over the middle of the conductor. The peak-to-peak 
responses of the in-phase and quadrature components 
depend on the quality of the conductor, which is expressed 
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by a response parameter such as /3 in Eq. (4.119). A suit¬ 
able function is the dimensionless parameter a = fi Q crco si, 
which contains the conductivity cr and width s of the con¬ 
ductor as well as the coil spacing I and frequency co of the 
EM system. The systematic variation of the response 
curves with the value of a (Fig. 4.61b) allows interpreta¬ 
tion of the quality of the conductor. A simple way of 
doing this is with the aid of model response curves. The 
variation of in-phase and quadrature signals over a con¬ 
ducting orebody can be modelled experimentally on a 
smaller scale in the laboratory. The smaller values of s and 
/ are compensated by larger values of a and co to give the 
same response parameter a. The model response curves for 
different a are then directly applicable to the interpretation 
of real conductors measured in the field. 

The most common use of EM induction methods is in 
lateral profiling, usually on traverses at right angles to the 
geological strike of dikes or other suspected conducting 
bodies. In environmental applications it is useful for 
locating buried pipes that may carry fluids or gases. 
Ground-based EM methods may also be used for vertical 
sounding, applying the same principles as in resistivity 
methods to obtain the conductivities of horizontal 
layers. The greater the separation of transmitter and 
receiver, the deeper is the maximum depth at which con¬ 
ductors may be analyzed. An important form of vertical 
EM sounding is the magnetotelluric method, which takes 
advantage of the penetrative ability of low-frequency 
signals from natural sources in the external geomagnetic 
field. 


be along the x-axis, so that the magnetic field B (being 
normal to E x ) is along the y-axis. Ampere's law, as summa¬ 
rized in Maxwell’s equations, relates E x to the gradient of 
B y in the r-direction. Because B has only a y-component, 
Ampere’s law simplifies to 


E = - 


1 


SB .. 


dz 


(4.120) 


where B y has the form of Eq. (4.111). Differentiating B y 
by parts gives 
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Comparison of Eq. (4.111) and Eq. (4.121) shows a 
phase shift of 45° (ir/4) between E x and B y . However, the 
ratio of the maximum amplitudes of the two compo¬ 
nents is 


\E,\ <2 

\B y \ M 0 o-d 


(4.122) 


43.6.3 Magnetotelluric sounding 

Magnetotelluric (MT) sounding is a natural-source elec¬ 
tromagnetic method. The fluctuating electromagnetic 
fields that originate in the ionosphere are partly reflected 
at the Earth’s surface; the returning fields are again 
reflected off the conducting ionosphere. This happens 
repeatedly, so that the fields eventually have a strong verti¬ 
cal component and may be regarded as vertically propa¬ 
gating plane waves with a wide spectrum of frequencies. 
These fields penetrate into the ground and induce telluric 
electric currents (Section 4.3.4.3), which in turn generate 
secondary magnetic fields. The telluric currents are 
detected with two pairs of electrodes, usually oriented 
north-south and east-west. Three components of the 
magnetic fields are measured: the vertical component and 
a horizontal component parallel to each of the telluric 
components. The method yields conductivity information 
from much greater depths than artificial-source induction 
methods. It has been applied in the search for petroleum 
and deep zones of mineralization in the upper crust. 
Utilizing long periods in the range 10-1000 s it is an 
important method for the investigation of the structure of 
the crust and upper mantle. 

Consider a plane electromagnetic wave propagating in 
the r-direction (Fig. 4.58). Let the electric component E x 


If we now substitute for d from Eq. (4.112) and write 


p= 1/er, we get 


/fiAl 2 

9 co\B y P 

(4.123) 

d= 2 W 

v\B y I 

(4.124) 


for the effective resistivity p at depth d. The analysis gives 
similar results for an electric field along the y-axis and a 
corresponding magnetic field along the .v-axis. In this case 
the ratio of the field amplitudes is |£7 > ,|/|Z? x .|. 

In addition to the horizontal magnetic fields, B x and 
B. the vertical component B_ is also recorded for use 
in the interpretation of two-dimensional structures. 
Thus the data set from an MT site consists of two electri¬ 
cal components and three magnetic field components 
recorded continuously during a lengthy observation 
interval covering some hours or days. The recorded mag¬ 
netic fields consist of an external part from the ionos¬ 
phere and an internal part related to the induced current 
distribution. These components must be separated ana¬ 
lytically. The electric and magnetic records contain 
numerous frequencies, some of which are simply noise 
and some are of geophysical interest. As a result, sophis- 
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Fig. 4.62 Two-dimensional 
resistivity model of the crust 
and upper mantle beneath 
Vancouver Island and the 
adjacent mainland derived 
from magnetotelluric results 
(redrawn from Kurtz etal., 
1986). 


Georgia . , , 

r. ., mainland 

Pacific Ocean -[-Vancouver Island -\ -1- British 



ticated data-processing is required, involving power- 
spectrum analysis and filtering. 

The interpretation of MT data is based on either mod¬ 
elling or inversion. The modelling method is a direct 
approach to solving the conductivity distribution. It 
assumes a conductivity model for which a theoretical 
response is calculated and compared with the real 
response. The parameters of the model are adjusted in 
turn repeatedly to obtain the most favorable fit to the 
observations. As in the case of vertical electrical sounding 
with direct currents (Section 4.3.5.6), the inversion 
method seeks a solution to the EM induction problem by 
using the frequency spectrum of the observations to 
establish the causal conductivity distribution. 

Although MT sounding can be carried out in the sub¬ 
audio to audio range (/ — 10-10 4 Hz), its main applica¬ 
tion is in determining the electrical conductivity at great 
depths using very low frequencies (J< 1 Hz). The investi¬ 
gation of resistivity in the crust and upper mantle using 
MT sounding is illustrated by a profile across Vancouver 
Island (Fig. 4.62). Twenty-seven MT sounding stations 
were located along a NW-SE reflection seismic profile. 
One-dimensional analysis of the vertical distribution of 
resistivity beneath three stations (10, 12 and 14 in Fig. 
4.62) showed an electrical discontinuity at virtually the 
same depth as a major seismic reflector observed in the 
associated seismic reflection profile. The information 
acquired about the mean resistivity above this depth was 
then used in a two-dimensional inversion of the MT 
records at all the stations. The resistivity pattern was 
interpreted down to depths of 100 km. It shows a north¬ 
eastward dipping zone of low resistivity (p®=30 II m), 
referred to as the E-conductor, surrounded by much more 
resistive material (p>5000 O m). The E-conductor was 
interpreted as the top of the descending Juan de Fuca 
plate, where it subducts under the North American plate. 
The anomalously high conductivity in the top of the plate 
was attributed to conducting fluids in sediments derived 
from the accretionary wedge. 


43.6.4 Ground-penetrating radar 

At high frequencies in poorly conducting media the 
conduction term in the electromagnetic equations is 
negligible compared to the displacement term. The elec¬ 
tric field equation then becomes 

V 2 E = » 0 ef t f (4.125) 


with a similar equation for the magnetic field. This has 
the familiar form of the wave equation, which describes 
the propagation of an elastic disturbance (Eqs. (3.55) 
and (3.56)). Analogously, Eq. (4.125) describes the prop¬ 
agation of the electric part of an electromagnetic wave. 
By comparing with the seismic wave equations we see 
that the E and B fields in an electromagnetic wave have 
the same velocity v, where v 2 = l/p, 0 e. In a vacuum the 
wave velocity is equal to the velocity of light c, given by 
c 2 = 1 lp- 0 s. Using the relationship e = ks 0 we get 
v 2 = c 2 Ik, and taking into account that the dielectric con¬ 
stant k is ~ 5-20 in Earth materials, the velocity of an 
electromagnetic wave in the ground is found to be about 
0.2c-0.6c. 

To simplify further discussion suppose that the elec¬ 
tromagnetic disturbance propagates along the z-axis (i.e., 
dldx = dldy — 0), so that V 2 = d 2 /dr 2 . For this one-dimen- 
sional case 


d 2 E_ d 2 E_ 1 d 2 E 
dz 2 B dt 2 v 2 dt 2 


(4.126) 


If we compare this equation with Eq. (3.58) for a seismic 
wave, we see that the solution for a component £j of the 
electric field is 

E,= E () s\n2u(^~ft'j (4.127) 


where A is the wavelength, / the frequency and /A = v, the 
velocity of the wave. 
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Fig. 4.63 Fracture pattern in 
a granitic bedrock revealed by 
ground-penetrating radar: (a) 
geological cross-section, (b) 
processed georadar reflection 
section (courtesy of A. G. 
Green). 


(a) 




The above considerations suggest that high-frequency 
electromagnetic waves travel in the ground in an analo¬ 
gous manner to seismic waves. Instead of being deter¬ 
mined by the elastic parameters the propagation of radar 
signals is dependent on the dielectric properties of the 
ground. A comparatively young branch of geophysical 
exploration has been developed to investigate under¬ 
ground structures with ground-penetrating radar (GPR, 
or georadar). GPR makes use of the familiar “echo- 
principle” used in reflection seismology. A very short 
radar pulse, lasting only several nanoseconds (i.e., 
~10“ 8 s) is emitted by a mobile antenna on the ground 
surface. The path of the radar signal through the ground 
can be traced as a ray, which experiences refractions, 
reflections and diffractions at boundaries where the dielec¬ 
tric constant changes. A second antenna, the receiver, is 
located close to the transmitter, as in the case of seismic 
reflection, so as to receive near-vertical reflections from 
underground discontinuities. The signal-processing tech¬ 
niques of reflection seismology can also be applied to the 
georadar signal to help minimize the effects of diffractions 
and other noise. Consequently, georadar provides a 
detailed picture of the shallow subsurface structure (Fig. 
4.63). It has become an important tool in environmental 
studies of near-surface features, such as buried and forgot¬ 
ten waste deposits, fracture patterns in otherwise uniform 
rock bodies, or the investigation of groundwater resources. 


High-frequency signals are rapidly attenuated with 
depth. Geometrical spreading of the signal outward from 
its source (spherical divergence) causes a decrease in 
intensity with distance. More important is absorption of 
the signal by ground materials, which is a function of 
their conductivity. Depending on the composition of the 
soil or rocks (e.g., the presence of clay-rich layers or 
groundwater), the nature of subsurface structures and the 
frequency of the radar signal, the effective penetration 
may be up to 10 m, although conditions commonly 
restrict it to only a few meters. However, at a radar fre¬ 
quency of 10 8 — 10 9 Hz and with a velocity of —10 s m s -1 
the resolution is in the range 0.1-1 m. Thus, despite its 
limited depth penetration, the high resolution of geo¬ 
radar makes it a powerful tool for near-surface geophysi¬ 
cal exploration. 

4.3.7 Electrical conductivity in the Earth 

The complicated structure of the crust and upper mantle 
results in large lateral variations in electrical conductivity. 
Apart from the oceans, sediments and individual anom¬ 
alous conductors, the outer carapace of the Earth is gener¬ 
ally a poor electrical conductor. The physical mechanism 
of conductivity in silicate rocks is by semiconduction, 
which can take place in three different ways (Section 
4.3.2.3). Each type of semiconduction is governed by a 
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thermally activated process, in which the conductivity a at 
temperature T is given by 

cr = a 0 Q~ E tJ kT (4.128) 

where k is Boltzmann’s constant (k= 1.38065X 10 -23 
J K -1 ). The constant a Q is the hypothetical maximum 
value of the conductivity, reached asymptotically at very 
high temperatures. E & is the activation energy of the par¬ 
ticular type of semiconduction. Its value determines the 
temperature range in which the thermally activated 
process becomes effective as a mechanism for cr. In the 
crust and upper mantle (i.e., in the lithosphere) impurity 
semiconduction is likely the main mechanism in dry 
rocks. Electronic semiconduction is probably dominant in 
the asthenosphere and deeper regions of the mantle. Ionic 
semiconduction is an important mechanism at high tem¬ 
perature, but is unlikely to be significant below about 
400 km, because it is suppressed by the high pressure at 
greater depth. 

The electrical conductivity in the Earth at great depths 
is inferred from four sources: deep electrical sounding, 
geomagnetic variations, secular variations and extrapola¬ 
tion from laboratory experiments. The first two methods 
are based on induction effects arising from changes in the 
external part of the geomagnetic field; these encompass a 
broad spectrum with peaks of energy at several periods 
(see Fig. 4.45). Electrical and magnetotelluric sounding 
use the components with periods from milliseconds to 
one or two days. The inversion of MT data gives a 
conductivity pattern that is generally concordant with 
seismic data and related to the broad geological structure 
of the crust and upper mantle. 

The time spectrum of external geomagnetic field varia¬ 
tions contains some prominent periods that are longer 
than a day (Fig. 4.45). The study of the longer-period 
geomagnetic variations provides information about con¬ 
ductivity in the Earth down to about 2000 km. The longer 
the period of the variation the deeper its penetration 
depth. The daily (or diurnal) variation (Section 5.4.3.3) 
yields conductivity information to about 900 km. 
Magnetic storms last several days or weeks and have a 
strong 48 hr component, which is used to extend conduc¬ 
tivity information to about 1000 km. In addition to the 
spectrum of geomagnetic variation shown in Fig. 4.45 
there is a longer-period component related to the 11-yr 
sunspot cycle. This results from increased solar activity 
and is accompanied by solar flares (see Section 5.4.7.1) 
and emissions of charged particles that augment the solar 
wind and excite ionospheric activity. Analysis of the 11-yr 
component allows the model of mantle conductivity to be 
extended to about 2000 km depth. Our knowledge of the 
electrical conductivity in the mantle at depths greater 
than 2000 km cannot be obtained from analysis of effects 
related to the external magnetic field. 

The secular variation of the internal geomagnetic field 
(Section 5.4.5) originates in the upper part of the fluid 


outer core. It consists of fluctuations in intensity and 
direction with periods of the order of 10-10 4 yr. If the 
secular variation could be observed at the core-mantle 
boundary it would be possible to determine conductivity 
throughout the mantle. Unfortunately, secular variation 
must be observed at the Earth's surface, after it has passed 
through the conducting mantle, which acts as a filter. The 
signal is attenuated by the skin effect, preferentially 
affecting the highest frequencies. Thus, observations of 
high-frequency changes in secular variation place an 
upper limit on the average conductivity of the mantle, 
because a greater conductivity would block them out. 
From time to time, abrupt changes in the rate of secular 
variation take place for unknown reasons. A conspicuous 
example of these “geomagnetic jerks” occurred in 
1969-1970, when a pulse in secular variation occurred 
with an estimated duration of less than two years. 
Although not all analysts concur, the effect is widely 
believed to be of internal origin. Analysis of the propaga¬ 
tion of a secular-variation pulse provides an estimate of 
the mean conductivity of the whole mantle, which, inte¬ 
grated with data from other sources, gives the conductiv¬ 
ity in the lower mantle. 

Some of the different models of mantle conductivity 
that have been proposed are shown in Fig. 4.64. The 
differences between the models reflect increases in quan¬ 
tity and improvements in quality of geomagnetic data as 
well as advances in the techniques of data-processing, 
especially the development of inversion methods. 
Although the models diverge in many respects they have 
some features in common. The conductivity averages 
about 10 -2 II -1 m” 1 in the lithosphere and increases with 
increasing depth. Sharper rates of increase are found at 
depths of 400 and 670 km, where the olivine-spinel and 
spinel-perovskite phase changes occur, respectively (see 
Section 3.7.5.2). At about 700 km depth each model gives 
a conductivity of about 1 II -1 m~' which rises in the 
lower mantle to be about 10-200 II -1 m" 1 at the 
core-mantle boundary. The secular variation is not 
uniform over the Earth’s surface. Large areas of conti¬ 
nental size (the Central Pacific is the best studied) are 
characterized by slow rates of variation. This is possibly 
due to the additional screening effect of features in the 
Delayer above the core-mantle boundary (Section 
3.7.5.3), so-called “crypto-continents” (see Fig. 4.38) in 
which the conductivity may be 1000 times higher than in 
the overlying mantle (Stacey, 1992). 

Conductivity in the outer core is estimated by extrapo¬ 
lation from laboratory experiments. The core has the 
composition of an iron alloy, with an iron content of 
—83% and a concentration of the alloying elements of 
— 17%. The effect of pressure on the conductivity of the 
alloy is not large at this concentration. Measurements of 
resistivity at atmospheric pressure and different tempera¬ 
tures lead to an extrapolated resistivity of p = 3.3 X 10 -6 
II m at the temperature of the outer core, with a corre¬ 
sponding conductivity cr = 3 X 10 -5 II -1 m _1 . 
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Fig. 4.64 Models of electrical 
conductivity (a) at depth in 
the mantle proposed by (a) 
MacDonald (1957; M57)and 
Banks (1969; B69), (b) 
Achache etal. (1981; A 81) 
and Stacey (1992; S 92). 


(a) (b) 



4.4 SUGGESTIONS FOR FURTHER READING 

Introductory level 

Mussett, A. E. and Khan, M. A. 2000. Looking into the Earth: 
An Introduction to Geological Geophysics, Cambridge: 
Cambridge University Press. 

Parasnis, D. S. 1997. Principles of Applied Geophysics, 5th edn, 
London: Chapman and Hall. 

Sharma, P. V. 1997. Environmental and Engineering Geophysics, 
Cambridge: Cambridge University Press. 

Intermediate level 

Dobrin, M. B. and Savit, C. H. 1988. Introduction to Geophysical 
Prospecting, 4th edn, New York: McGraw-Hill. 

Faure, G. and Mensing, T. M. 2005. Isotopes: Principles and 
Applications, Hoboken, NJ: Wiley. 

Fowler, C. M. R. 2004. The Solid Earth: An Introduction to 
Global Geophysics, 2nd edn, Cambridge: Cambridge 
University Press. 

Telford, W. M„ Geldart, L. P. and Sheriff, R. E. 1990. Applied 
Geophysics, Cambridge: Cambridge University Press. 
Turcotte, D. L. and Schubert, G. 2002. Geodynamics, 2nd edn, 
Cambridge: Cambridge University Press. 

Advanced level 

Cathles, L. M. 1975. The Viscosity of the Earth’s Mantle, 
Princeton, NJ: Princeton University Press. 

Dalrymple, G. B. 1991. The Age of the Earth, Stanford, CA: 
Stanford University Press. 

Davies, G. F. 1999. Dynamic Earth: Plates, Plumes and Mantle 
Convection, Cambridge: Cambridge University Press. 

Dickin, A. P. 2005. Radiogenic Isotope Geology, 2nd edn, 
Cambridge: Cambridge University Press. 


Grant, F. S. and West, G. F. 1965. Interpretation Theory in 
Applied Geophysics, New York: McGraw-Hill. 

Jessop, A. M. 1990. Thermal Geophysics, Amsterdam: 
Elsevier. 

Peltier, W. R. (ed) 1989. Mantle Convection: Plate Tectonics and 
Global Dynamics, New York: Gordon and Breach. 

Ranalli, G. 1995. Rheology of the Earth: 2nd edn, London: 
Chapman and Hall. 

Schubert, G., Turcotte, D. L. and Olson, P. 2001. Mantle 
Convection in the Earth and Planets, Cambridge: Cambridge 
University Press. 

Stacey, F. D. 1992. Physics of the Earth, 3rd edn, Brisbane: 
Brookfield Press. 

York, D. and Farquhar, R. M. 1972. The Earth’s Age and 
Geochronology, Oxford: Pergamon Press. 


4.5 REVIEW QUESTIONS 

1. Define the following age-dating parameters: (a) decay 
constant, (b) half-life, (c) isochron. 

2. The radioactive carbon method of age dating is a 
simple decay analysis. Explain what this statement 
means. Describe the principle of the method. 

3. Describe the principle of a mass spectrometer. What 
is the Lorentz force? 

4. What aspects make the 40 K/ 40 Ar method suitable for 
determining the ages of rocks? What advantages 
does 40 Ar/ 39 Ar dating have over the 40 K/ 40 Ar 
method? 

5. What types of materials are suitable for dating with 
the radioactive carbon method? For what range of 
ages may it be applied? What are possible problems 
with the method? 
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6. Where are the oldest regions of the oceans? Where 
are the oldest continental regions? Compare the ages 
of the oldest oceanic and continental regions and 
account for the difference. 

7. Explain the uranium-lead dating method. What is 
the concordia curve? What is the discordia line? Why 
is the U-Pb method suitable for dating very old mate¬ 
rials, such as Precambrian rocks? 

8. Why are zircons important for dating very old rocks? 
How do the ages of the oldest rocks on Earth 
compare with the ages of meteorites and the Moon? 

9. What is meant by temperature? What is meant by 
heat? 

10. What are the processes by which heat can be trans¬ 
ferred? What is the relative importance of each 
process in (a) the crust, (b) the mantle, (c) the outer 
core, and (d) the inner core? 

11. Sketch how (a) temperature and (b) the melting point 
(solidus) vary with depth in the Earth's interior. 

12. How is heat flow defined? How is it measured (a) on 
the continents and (b) in the oceans? 

13. What factors determine the depth of penetration of 
solar energy into the Earth? What precautions does 
this impose for measuring heat flow? 

14. Why is the average oceanic heat flow higher than the 
average continental heat flow? 

15. How does heat flow vary with distance from an 
oceanic ridge? 

16. Which regions of the Earth have (a) the highest and 
(b) the lowest heat flow? 

17. Discuss the statement: “The internal heat of the 
Earth causes the formation of mountains and the 
external heat of the Sun causes their destruction.” 

18. Which characteristics of the ground determine its 
electrical resistivity? 

19. Explain why geoelectrical resistivity measurements 
yield only an apparent resistivity. 

20. What are telluric currents? How do they originate? 

21. What is meant by the skin depth for the propagation 
of electromagnetic waves? 

22. What are the in-phase and quadrature components in 
an electromagnetic induction survey? What causes 
the phase shift? Which component responds more 
strongly to the presence of a good conductor? 

23. What is the magnetotelluric method of electromag¬ 
netic surveying? What are the merits of this method 
for deep Earth sounding? 

24. What is ground-penetrating radar and in which part 
of the electromagnetic spectrum is it operative? Why 
can GPR signals be processed analogously to 
reflected and refracted seismic waves? 

25. Why is GPR a powerful method for exploring 
shallow subsurface structure? Which properties of 
the ground determine the effectiveness of the method 
and limit its depth range? 


4.6 EXERCISES 

Geochronology 

1. How many half-lives must elapse before the activity 
of a radioactive isotope decreases to 1% of its initial 
value? How long is this time for 14 C, which has a 
decay rate of 1.21 X 10 -4 yr _1 ? 

2. Radiocarbon dating of a sample of wood from the 
tomb of an Egyptian pharaoh gave isotopic concen¬ 
trations of 9.843 X 1(T 15 mol g -1 for 14 C and 1.202 X 
10 -2 mol g~* for 12 C. Assuming that the initial 
14 C/ 12 C ratio in the sample corresponded to the long¬ 
term atmospheric ratio of 1.20 XlO -12 , determine 
the age of the tomb, the percentage of 14 C remaining, 
and the original 14 C concentration in the wood. 

3. The decay constants of 235 U and 238 U are A 235 = 
9.8485XlO- 10 yr" 1 and A 238 = 1.55125 X 10“ 10 yr” 1 . 
Calculate the half-lives of these uranium isotopes. 

4. Assuming that the isotopes 235 U and 238 U were 
created in a common event, such as a supernova, and 
given that their abundances are now in the ratio 
235 U/ 238 U= 1/137.88, calculate how long ago they 
were created. 

5. The analysis of strontium and rubidium isotopes in 
whole rock samples from a granitic batholith gave the 
following atomic concentrations in p.p.m.: 


Sample 

87 Sr 

87 Rb 

86 Sr 

A 

2.304 

8.831 

2.751 

B 

0.518 

29.046 

0.450 

C 

1.619 

111.03 

1.232 

D 

1.244 

100.60 

0.871 


(a) Calculate the 87 Rb/ 86 Sr and 87 Sr/ 86 Sr isotopic 
ratios for these samples. 

(b) Determine the age of the batholith and the initial 
87 Sr/ 86 Sr ratio. 


Argon-argon dating of muscovite in a Late 
Cretaceous granite gave the following isotope ratios 
for the plateau stages during incremental heating: 

Maximum heating 



temperature [°C] 

39 Ar/ 36 Ar 

40 Ar/ 36 Ar 

750 

1852 

8855 

830 

1790 

8439 

895 

1439 

6867 

970 

3214 

15380 

1030 

2708 

12970 


(a) Calculate the 40 Ar/ 39 Ar ratios for each incremen¬ 
tal heating step. 
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(b) A calibration constant /= 0.00964 was deter¬ 
mined for the monitor mineral. Using the 
40 Ar/ 39 Ar ratios from part (a) in Eq. (4.18), calcu¬ 
late the apparent ages at each heating step. 

(c) Draw an 40 Ar/ 39 Ar isochron diagram by plotting 
each 40 Ar/ 36 Ar ratio as ordinate against the corre¬ 
sponding 39 Ar/ 36 Ar ratio as abscissa. Draw a 
best-fitting line - the isochron - through the data 
points, and determine its slope and intercept. 

(d) Compute the age of the muscovite from the slope 
of the isochron. Is the intercept on the ordinate 
axis significant? 

7. The following isotopic ratios were measured in U-Pb 
age determinations on three zircon grains extracted 
from a granite: 


Sample 

207 Pb/ 235 U 

206 Pb/ 238 U 

zircon 1 

27.4 

0.60 

zircon 2 

33.3 

0.68 

zircon 3 

37.9 

0.74 


(a) Using the values listed in Table 4.2, plot a Con¬ 
cordia diagram on graph paper or with a plotting 
routine. Enter the measurements from the above 
table on the graph, and draw the straight discor- 
dia line through the points. 

(b) Determine the coordinates of the intersection 
points of the concordia and discordia lines. 

(c) Using the coordinates of the upper intersection 
point together with Eq. (4.19) and Eq. (4.20), cal¬ 
culate the age of formation of the zircons. 

(d) Calculate when loss of lead occurred in the 
zircons. 

8. The following isotopic ratios were measured in a 
K-Ar age determination on an ignimbrite as part of 
a combined radiometric-paleomagnetic study of geo¬ 
magnetic polarity. 


Sample 

40 K/ 36 Ar 

40 Ar/ 36 Ar 

A 

4,716,000 

822 

B 

8,069,000 

1200 

C 

12,970,000 

1730 

D 

27,670,000 

3280 


(a) Plot the isotope ratios, draw the isochron, and 
compute its slope and intercept. 

(b) Calculate the isochron age of the ignimbrite. 

(c) Correct the observed 40 Ar/ 36 Ar ratios for the 
initial 40 Ar/ 36 Ar concentration, and compute the 
individual sample ages. 

(d) Calculate the mean age and its standard deviation. 
Compare the mean age with the isochron age. 


(e) With reference to the radiometric timescale in 
Fig. 5.74, what magnetic polarity would you 
expect the ignimbrite samples to have? 

The Earth's heat 

9. List and compare the various factors that may influ¬ 
ence the measured temperature gradient at a depth of 
5 m in (a) a deep drillhole in oceanic sediments and 
(b) a continental well that encounters the groundwa¬ 
ter table at 2 m depth. 

10. A shallow circular pond 100m in diameter freezes 
solid during a very cold night. The pond is in a geo¬ 
thermal area in which the temperature reaches 40 °C 
at 200 m depth. The thermal conductivity of the 
intervening rock is 3.75 W m -1 K -1 and the latent 
heat of fusion of ice is 334 kJ kg -1 . Neglecting other 
heat sources, calculate the mass of ice that melts per 
hour due to the geothermal gradient. 

11. Assuming a constant geothermal gradient of 30 °C 
per kilometer, estimate what percentage of the 
Earth's volume is hotter than the temperature of 
molten lava at atmospheric pressure. Why is the 
deeper interior of the Earth not entirely molten? 

12. The mean global heat flow at the Earth’s surface is 
87mW m -2 . Calculate the time in years needed for 
the mantle and core to cool by 100 °C, with the fol¬ 
lowing assumptions: (i) the Earth’s mantle and core 
cool as a homogeneous unit, (ii) 20% of the observed 
heat flow at the Earth’s surface is from the mantle, 
(iii) the lithospheric thickness is 100 km, (iv) thermal 
effects from the lithosphere itself may be ignored. 
Relevant properties of the mantle and core are: mean 
density 5650 kgm -3 , specific heat 400J kg -1 K -1 . 

13. A temperature gradient of 35 °C km -1 is measured in 
the upper few meters of sediments covering the ocean 
floor. If the mean thermal conductivity of oceanic 
sediments is 1.7 W m -1 K -1 , calculate the local heat 
flow. How far do you think the sampling site is from 
the nearest active ridge? 

14. What heat flow values would you expect at the loca¬ 
tions of the oceanic magnetic anomalies with 
numbers C5N, CION, C21N, C32N, M0? Interpret 
the ages of the anomalies from Fig. 5.78 and use the 
heat-flow model GDH1 (Eq. (4.62)) for the cooling of 
oceanic lithosphere. 

15. Using the relationships in Eq. (4.63), estimate the 
approximate depths of the ocean at these locations? 
What is the thickness of the elastic lithosphere and 
the depth of the top of the asthenosphere at these 
locations (see Fig. 2.79)? 

16. Assuming that the Earth initially had a uniform tem¬ 
perature throughout and has been cooling by con- 
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duction only, use the solution for the one-dimen- 
sional cooling of a semi-infinite half-space (Eq. 
(4.57) and Box 4.2) to derive Eq. (4.2) for Kelvin’s 
estimated age of the Earth. 

17. The temperature in the near-surface layers of the 
Earth’s crust varies cyclically with daily, annual and 
longer periods. For a surface temperature variation 
given by T=T 0 cos wt, the temperature variation at 
depth z and time t is described by: 


T(z,t) = r o exp( -|) cos (*,*-§) 



where r is the period of the variation, k is the 
thermal conductivity, c p is the specific heat, and p is 
the density. For surface sediments assume A: = 2.5 
Wm K -1 , c p = 10 3 J kg -1 K -1 , and p = 2300 kg m -3 . 

(a) Calculate the phase difference (in days) between 
the temperature variation at the surface and at 
depths of 2 m and 5 m, respectively. Perform the 
calculations for both the daily and annual tem¬ 
perature fluctuations. 

(b) Assuming that the range in surface temperatures 
between summer and winter is 40 °C, calculate 
the depth at which the annual temperature range 
is 5 °C. How large (in weeks and days) is the 
phase difference between the surface temperature 
and the actual temperature at this depth? 

18. The average daily temperature in northern Canada is 
+10 °C in July and —20 °C in January. Using the heat 
conduction equation calculate the thickness of 
the permafrost layer (below which the ground is per¬ 
manently frozen). Relevant physical properties of the 
ground are: thermal conductivity k = 3 W m K _1 , spe¬ 
cific heat c p = 840 J kg -1 K _1 , density p = 2700 kgm -3 . 

19. The half-spreading rate at an oceanic ridge in the 
middle of a symmetric ocean basin bounded by sub- 
duction zones is 44 mm yr -1 . The ridge is 1000 km 
long and the distance from the ridge to each subduc- 
tion zone is 2000 km. If the oceanic heat flow varies 
with crustal age as in Eq. (4.62), calculate how much 
heat is lost per year from the ocean basin. 

Geoelectricity 

20. At the interface between two layers with electrical 
resistivities p x and p ? , as in the figure below, the elec¬ 
trical boundary conditions are: (i) the component of 
current density J, normal to the interface is con¬ 
tinuous, and (ii) the component of electric field E x 
tangential to the interface is continuous. A current 
flow-line makes angles 6 X and 0 2 before and after 
refraction, respectively. 


z 



Derive the electrical “law of refraction” given by 
Eq. (4.102): 

tandj p 2 
tand 2 — Pi 

21. What is the effective resistivity of a slab of thickness 
L composed of two half-slabs each of thickness Z./2 
and with resistivities (2p) and (p/2), respectively, as in 
the diagram? 


0 Lil L 


-P 


pl 2 


22. Sea-water is contaminating an aquifer that is the 
source of drinking water for a seaside town. The fol¬ 
lowing measurements of apparent resistivity (p a ) 
were made at various electrode separations (a) with 
the expanding-spread Wenner method to investigate 
the leak. 


a 

[m] 

Pa 

[H m] 

a 

[m] 

Pa 

[11m] 

a 

[m] 

Pa 

[A m] 

10 

29.0 

140 

19.8 

280 

8.7 

20 

28.9 

160 

18.0 

300 

7.8 

40 

28.5 

180 

16.3 

320 

7.1 

60 

27.1 

200 

14.5 

340 

6.7 

80 

25.3 

220 

12.9 

360 

6.5 

100 

23.5 

240 

11.3 

400 

6.4 

120 

21.7 

260 

9.9 

440 

6.4 
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(a) Estimate the electrical resistivity of each layer. 

(b) Divide the apparent resistivity at each position by 
the resistivity of the upper layer, then plot the 
normalized resistivity against electrode separa¬ 
tion on a log-log diagram on the same scale as 
the model curves in Fig. 4.53. 

(c) Match the measured curve with the model curves 
and estimate the depth to the interface. 

23. In the Schlumberger resistivity method the separa¬ 
tion of the current electrodes L is much larger than 
the separation a of the voltage electrodes. Suppose 
that the mid-point of the voltage pair is displaced by 
a distance x from the mid-point of the current elec¬ 
trode pair. Show that, for (L - 2x) > a, the apparent 
resistivity is given by 

_ tt y (L 2 — 4.v 2 ) 2 
Pa ~ 4 7 a (L 2 + Ax 2 ) 

24. In the double-dipole resistivity method it is common 
to keep the separation of the pairs L an integer 
multiple n of the distance a between the electrodes in 
each pair, i.e. L = na. 

(a) Rewrite the formula for the apparent resistivity 
with this assumption. 


(b) If L is very large compared to a. modify the 
formula to show that the apparent resistivity is 
proportional to n 3 . 

25. Consider a double-dipole configuration in which the 
electrode pairs are not collinear but are broadside to 
each other (i.e., normal to the line joining them). The 
electrode separation is a and the distance between the 
mid-points of the pairs is L = na. Show that, for large 
values of «, the apparent resistivity is given in this 
case by 

P a = 2 tt rf'arj 

26. Calculate the velocity of an electromagnetic wave in 
(a) basalt (dielectric constant k = 12) and (b) water 
(k = 80.4). 

27. Calculate the “skin depths” of penetration in (a) 
granite (p = 5,000 II m) and (b) a pyrrhotite orebody 
(p = 5 X 10~ 5 II m) for electromagnetic waves in surveys 
employing (i) electromagnetic induction (/ = 1 kHz) 
and (ii) ground penetrating radar (/=100MHz). 
Would these methods detect the conducting bodies if 
they were buried under a water-saturated soil layer, 3 m 
thick with resistivity 100II m? 
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5.1 HISTORICAL INTRODUCTION 

5.1.1 The discovery of magnetism 

Mankind’s interest in magnetism began as a fascination 
with the curious attractive properties of the mineral lode- 
stone, a naturally occurring form of magnetite. Called 
loadstone in early usage, the name derives from the old 
English word load, meaning “way” or “course”; the load¬ 
stone was literally a stone which showed a traveller the 
way. 

The earliest observations of magnetism were made 
before accurate records of discoveries were kept, so that 
it is impossible to be sure of historical precedents. 
Nevertheless, Greek philosophers wrote about lodestone 
around 800 BC and its properties were known to the 
Chinese by 300 BC. To the ancient Greeks science was 
equated with knowledge, and was considered an element 
of philosophy. As a result, the attractive forces of lode- 
stone were ascribed to metaphysical powers. Some early 
animistic philosophers even believed lodestone to possess 
a soul. Contemporary mechanistic schools of thought 
were equally superstitious and gave rise to false concep¬ 
tions that persisted for centuries. Foremost among these 
was the view that electrical and magnetic forces were 
related to invisible fluids. This view persisted well into the 
nineteenth century. The power of a magnet seemed to 
flow from one pole to the other along lines of induction 
that could be made visible by sprinkling iron filings on a 
paper held over the magnet. The term “flux” (synony¬ 
mous with flow) is still found in “magnetic flux density,” 
which is regularly used as an alternative to “magnetic 
induction” for the fundamental magnetic field vector B. 

One of the greatest and wealthiest of the ancient 
Greek city-colonies in Asia Minor was the seaport of 
Ephesus, at the mouth of the river Meander (modern 
Kuquk Menderes) in the Persian province of Caria, in 
what is now the Turkish province of western Anatolia. In 
the fifth century BC the Greek state of Thessaly founded 
a colony on the Meander close to Ephesus called 
Magnesia, which after 133 BC was incorporated into the 
Roman empire as Magnesia ad Maeandrum. In the vicin¬ 
ity of Magnesia the Greeks found a ready supply of lode¬ 
stone, pieces of which subsequently became known by the 
Latin word magneto from which the term magnetism 
derives. 


It is not known when the directive power of the magnet 
- its ability to align consistently north-south - was first 
recognized. Early in the Han dynasty, between 300 and 
200 BC, the Chinese fashioned a rudimentary compass 
out of lodestone. It consisted of a spoon-shaped object, 
whose bowl balanced and could rotate on a flat polished 
surface. This compass may have been used in the search 
for gems and in the selection of sites for houses. Before 
1000 AD the Chinese had developed suspended and 
pivoted-needle compasses. Their directive power led to the 
use of compasses for navigation long before the origin of 
the aligning forces was understood. As late as the twelfth 
century, it was supposed in Europe that the alignment of 
the compass arose from its attempt to follow the pole star. 
It was later shown that the compass alignment was pro¬ 
duced by a property of the Earth itself. Subsequently, the 
characteristics of terrestrial magnetism played an impor¬ 
tant role in advancing the understanding of magnetism. 

5.1.2 Pioneering studies in terrestrial magnetism 

In 1269 the medieval scholar Pierre Pelerin de Maricourt, 
who took the Latin nom-de-plume of Petrus Peregrinus, 
wrote the earliest known treatise of experimental physics 
(Epistola de Magnete ). In it he described simple laws of 
magnetic attraction. He experimented with a spherical 
magnet made of lodestone, placing it on a flat slab of iron 
and tracing the lines of direction which it assumed. These 
lines circled the lodestone sphere like geographical merid¬ 
ians and converged at two antipodal points, which 
Peregrinus called the poles of the magnet, by analogy to 
the geographical poles. He called his magnetic sphere a 
terrella, for “little Earth.” 

It was known to the Chinese around 500 AD, in the 
Tang dynasty, that magnetic compasses did not point 
exactly to geographical north, as defined by the stars. The 
local deviation of the magnetic meridian from the geo¬ 
graphical meridian is called the magnetic declination. By 
the fourteenth century, the ships of the British navy were 
equipped with a mariner’s compass, which became an 
essential tool for navigation. It was used in conjunction 
with celestial methods, and gradually it became apparent 
that the declination changed with position on the globe. 
During the fifteenth and sixteenth centuries the world¬ 
wide pattern of declination was established. By the 
end of the sixteenth century, Mercator recognized that 
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declination was the principal cause of error in contempo¬ 
rary map-making. 

Georg Hartmann, a German cleric, discovered in 1544 
that a magnetized needle assumed a non-horizontal atti¬ 
tude in the vertical plane. The deviation from the horizon¬ 
tal is now called the magnetic inclination. He reported his 
discovery in a letter to his superior, Duke Albrecht of 
Prussia, who evidently was not impressed. The letter lay 
unknown to the world in the royal archives until its dis¬ 
covery in 1831. Meanwhile, an English scientist, Robert 
Norman, rediscovered the inclination of the Earth’s mag¬ 
netic field independently in 1576. 

In 1600 William Gilbert (1544-1603), an English scien¬ 
tist and physician to Queen Elizabeth, published De 
Magnete, a landmark treatise in which he summarized all 
that was then known about magnetism, including the 
results of about seventeen years of his own research. His 
studies extended also to the electrostatic effects seen when 
some materials were rubbed, for which he coined the name 
“electricity” from the Greek word for amber. Gilbert was 
the first to distinguish clearly between electrical and mag¬ 
netic phenomena. His magnetic studies followed the work 
of Peregrinus three centuries earlier. Using small magnetic 
needles placed on the surface of a sphere of lodestone to 
study its magnetic field, he recognized the poles, where the 
needles stood on end, and the equator, where they lay par¬ 
allel to the surface. Gilbert achieved the leap of imagina¬ 
tion that was necessary to see the analogy between the 
attraction of the lodestone sphere and the known mag¬ 
netic properties of the Earth. He recognized that the Earth 
itself behaved like a large magnet. This was the first 
unequivocal recognition of a geophysical property, pre¬ 
ceding the laws of gravitation in Newton’s Principia by 
almost a century. Although founded largely on qualitative 
observations, De Magnete was the most important work 
on magnetism until the nineteenth century. 

The discovery that the declination of the geomagnetic 
field changed with time was made by Henry Gellibrand 
(1597-1637), an English mathematician and astronomer, 
in 1634. He noted, on the basis of just three measure¬ 
ments made by William Borough in 1580, Edmund 
Gunter in 1622 and himself in 1634, that the declination 
had decreased by about 7° in this time. From these few 
observations he deduced what is now called the secular 
variation of the field. 

Gradually the variation of the terrestrial magnetic 
field over the surface of the Earth was established. In 
1698-1700 Edmund Halley, the English astronomer and 
mathematician, carried out an important oceanographic 
survey with the prime purpose of studying compass vari¬ 
ations in the Atlantic ocean. In 1702 this resulted in the 
publication of the first declination chart. 

5.1.3 The physical origins of magnetism 

By the end of the eighteenth century many characteristics 
of terrestrial magnetism were known. The qualitative 


properties of magnets (e.g., the concentration of their 
powers at their poles) had been established, but the accu¬ 
mulated observations were unable to provide a more fun¬ 
damental understanding of the phenomena because they 
were not quantitative. A major advance was achieved by 
Charles Augustin de Coulomb (1736-1806), the son of a 
noted French family, who in 1784 invented a torsion 
balance that enabled him to make quantitative measure¬ 
ments of electrostatic and magnetic properties. In 1785 he 
published the results of his intensive studies. He estab¬ 
lished the inverse-square law of attraction and repulsion 
between small electrically charged balls. Using thin, mag¬ 
netized steel needles about 24 inches (61 cm) in length, he 
also established that the attraction or repulsion between 
their poles varied as the inverse square of their separation. 

Alessandro Volta (1745-1827) invented the voltaic cell 
with which electrical currents could be produced. The 
relationship between electrical currents and magnetism 
was detected in 1820 by Hans Christian Oersted 
(1777-1851), a Danish physicist. During experiments 
with a battery of voltaic cells he observed that a magnetic 
needle is deflected at right angles to a conductor carrying 
a current, thus establishing that an electrical current pro¬ 
duces a magnetic force. 

Oersted’s result was met with great enthusiasm and 
was followed at once by other notable discoveries in the 
same year. The law for the direction and strength of the 
magnetic force near a current-carrying wire was soon for¬ 
mulated by the French physicists Jean-Baptiste Biot 
(1774-1862) and Felix Savart (1791-1841). Their compa¬ 
triot Andre Marie Ampere (1775-1836) quickly under¬ 
took a systematic set of experiments. He showed that a 
force existed between two parallel straight current- 
carrying wires, and that it was of a type different from the 
known electrical forces. Ampere experimented with the 
magnetic forces produced by current loops and proposed 
that internal electrical currents were responsible for the 
existence of magnetism in iron objects (i.e., ferromagnet¬ 
ism). This idea of permanent magnetism due to con¬ 
stantly flowing currents was audacious for its time. 

At this stage, the ability of electrical currents to gener¬ 
ate magnetic fields was known, but it fell to the English 
scientist Michael Faraday (1791-1867) to demonstrate 
in 1831 what he called “magneto-electric” induction. 
Faraday came from a humble background and had little 
mathematical training. Yet he was a gifted experimenter, 
and his results demonstrated that the change of magnetic 
flux in a coil (whether produced by introducing a magnet 
or by the change in current in another coil) induced an 
electric current in the coil. The rule that governs the direc¬ 
tion of the induced current was formulated three years 
later by a Russian physicist, Heinrich Lenz (1804-1865). 
Unhampered by mathematical equations, Faraday made 
fundamental contributions to understanding magnetic 
processes. Instead of regarding magnetic and electrical 
phenomena as the effects of centers of force acting at a 
distance, he saw in his mind’s eye fictional lines of force 
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traversing space. This image emphasized the role of the 
medium and led eventually to the concept of magnetic 
field, which Faraday first used in 1845. 

Although much had been established by the early 
1830s, it was still necessary to interpret the strengths of 
magnetic forces by relating magnetic units to mechanical 
units. This was achieved in 1832 by the German scientist 
and mathematician, Carl Friedrich Gauss (1777-1855), 
who assumed that static magnetism was carried by mag¬ 
netic “charges,” analogous to the carriers of static elec¬ 
tricity. Experiment had shown that, in contrast to electric 
charge, magnetic poles always occur as oppositely signed 
pairs, and so the basic unit of magnetic properties corre¬ 
sponds to the dipole. Together with Wilhelm Weber 
(1804-1891), Gauss developed a method of absolute 
determination of the intensity of the Earth's magnetic 
field. They founded a geomagnetic observatory at 
Gottingen where the Earth’s magnetic field was observed 
at regular intervals. By 1837 global charts of the total 
intensity, inclination and declination were in existence, 
although the data had been measured at different times 
and their areal coverage was incomplete. To analyze the 
data-set Gauss applied the mathematical techniques of 
spherical harmonic analysis and the separation of vari¬ 
ables, which he had invented. In 1839 he established that 
the main part of the Earth's magnetic field was a dipole 
field that originated inside the Earth. 

The fundamental physical laws governing magnetic 
effects were now firmly established. In 1872 James Clerk 
Maxwell (1831-1879), a Scottish physicist, derived a set 
of equations that quantified all known relationships 
between electrical and magnetic phenomena: Coulomb’s 
laws of force between electric charges and magnetic poles; 
Oersted’s and Ampere's laws governing the magnetic 
effects of electric currents; Faraday’s and Lenz’s laws of 
electromagnetic induction; and Ohm's law relating 
current to electromotive force. Maxwell’s mathematical 
studies predicted the propagation of electric waves in 
space, and concluded that light is also an electromagnetic 
phenomenon transmitted through a medium called the 
luminiferous ether. The need for this light-transmitting 
medium was eliminated by the theory of relativity. By 
putting the theory of the electromagnetic field on a math¬ 
ematical basis, Maxwell enabled a greater understanding 
of electromagnetic phenomena before the discovery of 
the electron. 

A further notable discovery was made in 1879 by 
Heinrich Lorentz (1853-1928), a Dutch physicist. In 
experiments with vacuum tubes he observed the deflec¬ 
tion of a beam of moving electrical charge by a magnetic 
field. The deflecting force acted in a direction perpendicu¬ 
lar to the magnetic field and to the velocity of the charged 
particles, and was proportional to both the field and the 
velocity. This result now serves to define the unit of mag¬ 
netic induction. 

Since the time of man's first awareness of magnetic 
behavior, students of terrestrial magnetism have made 


important contributions to the understanding of magnet¬ 
ism as a physical phenomenon. In turn, advances in the 
physics of magnetism have helped geophysicists to under¬ 
stand the morphology and origin of the Earth’s magnetic 
field, and to apply this knowledge to geological processes, 
such as global tectonics. The physical basis of magnetism 
is fundamental to the geophysical topics of geomagnet¬ 
ism, rock magnetism and paleomagnetism. 

5.2 THE PHYSICS OF MAGNETISM 

5.2.1 Introduction 

Early investigators conceptualized gravitational, electri¬ 
cal and magnetic forces between objects as instantaneous 
effects that took place through direct action-at-a- 
distance. Faraday introduced the concept of the field of a 
force as a property of the space in which the force acts. 
The force-field plays an intermediary role in the interac¬ 
tion between objects. For example, an electric charge is 
surrounded by an electrical field that acts to produce a 
force on a second charge. The pattern of a field is por¬ 
trayed by field lines. At any point in a field the direction of 
the force is tangential to the field line and the intensity of 
the force is proportional to the number of field lines per 
unit cross-sectional area. 

Problems in magnetism are often more complicated 
for the student than those in gravitation and electrostat¬ 
ics. For one thing, gravitational and electrostatic fields act 
centrally to the source of force, which varies in each case 
as the inverse square of distance. Magnetic fields are not 
central; their directions vary with azimuth. Moreover, 
even in the simplest case (that of a magnetic dipole or a 
small current loop) the field strength falls off inversely as 
the cube of distance. To make matters more complicated, 
the student has to take account of two magnetic fields 
(denoted by B and H). 

The confusion about the B-field and the H-field may 
be removed by recalling that all magnetic fields originate 
with electrical currents. This is the case even for perma¬ 
nent magnets, as Ampere astutely recognized in 1820. We 
now know that these currents are associated with the 
motions of electrons about atomic nuclei in the perma¬ 
nent magnets. The fundamental magnetic field associated 
with currents in any medium is B. The quantity H should 
be regarded as a computational parameter proportional 
to B in non-magnetizable materials. Inside a magnetizable 
material, H describes how B is modified by the magnetic 
polarization (or magnetization, M) of the material. The 
magnetic B-field is also called the magnetic induction or 
magnetic flux density. 

Historically, the laws of magnetism were established 
by relating the B-field to fictitious centers of magnetic 
force called magnetic poles, defined by comparison with 
the properties of a bar magnet. Gauss showed that, in 
contrast to electrostatic charges, free magnetic poles 
cannot exist; each positive pole must be paired with a 
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Fig. 5.1 The characteristic 
field lines of a magnetic 
dipole are found around (a) a 
short bar magnet, (b) a small 
loop carrying an electric 
current, and (c) a uniformly 
magnetized sphere. 





corresponding negative pole. The most important type of 
magnetic field - and also the dominant component of the 
geomagnetic field - is that of a magnetic dipole (Fig. 5.1a). 
This is the field of two magnetic poles of opposite sense 
that are infinitesimally close to each other. The geometry 
of the field lines shows the paths along which a free mag¬ 
netic pole would move in the vicinity of the dipole. A tiny 
current loop (Fig. 5.1b) and a uniformly magnetized 
sphere (Fig. 5.1c) also have dipole-type magnetic fields 
around them. Although magnetic poles do not exist phys¬ 
ically, many problems that arise in geophysical situations 
can be readily solved in terms of surface distributions of 
poles or dipoles. So we will first examine these concepts. 


5.2.2 Coulomb's law for magnetic poles 

Coulomb’s experiments in 1785 established that the force 
between the ends of long thin magnets was inversely pro¬ 
portional to the square of their separation. Gauss 
expanded Coulomb’s observations and attributed the 
forces of attraction and repulsion to fictitious magnetic 
charges, or poles. An inverse square law for the force F 
between magnetic poles with strengths p 1 and p 2 at dis¬ 
tance r from each other can be formulated as 


F(r) = K 


P 1 P 2 


(5.1) 


The proportionality constant K was originally defined to 
be dimensionless and equal to unity, analogously to the 
law of electrostatic force. This gave the dimensions of 
pole strength in the centimeter-gram-second (c.g.s.) 
system as dyne 1/2 cm. 


5.2.2.1 The field of a magnetic pole 

The gravitational field of a given mass is defined as the 
force it exerts on a unit mass (Section 2.2.2). Similarly, the 
electric field of a given charge is the force it exerts on a 
unit charge. These ideas cannot be transferred directly to 
magnetism, because magnetic poles do not really exist. 
Nevertheless, many magnetic properties can be described 
and magnetic problems solved in terms of fictitious poles. 


For example, we can define a magnetic field B as the force 
exerted by a pole of strength p on a unit pole at distance r. 
From Eq. (5.1) we get 

B{r) = K^ (5.2) 

Setting K= 1, the unit of the magnetic B-field has 
dimensions dyne 1/2 cm -1 in c.g.s. units and is called a 
gauss. Geophysicists employ a smaller unit, the gamma 
(y), to describe the geomagnetic field and to chart mag¬ 
netic anomalies (1 -y = 10 -5 gauss). 

Unfortunately, the c.g.s. system required units of elec¬ 
trical charge that had different dimensions and size in 
electrostatic and electromagnetic situations. By interna¬ 
tional agreement the units were harmonized and rational¬ 
ized. In the modern Systeme Internationale (SI) units the 
proportionality constant K is not dimensionless. It has 
the value /t () /4it, where p 0 is called the permeability con¬ 
stant and is equal to 4 ttX 10~ 7 N A -2 (or henry/meter, 
H m -1 , which is equivalent to N A -2 ). 

52.2.2 The potential of a magnetic pole 

In studying gravitation we also used the concept of a field 
to describe the region around a mass in which its attraction 
could be felt by another test mass. In order to move the test 
mass away from the attracting mass, work had to be done 
against the attractive force and this was found to be equal 
to the gain of potential energy of the test mass. When the 
test mass was a unit mass, the attractive force was called 
the gravitational field and the gain in potential energy was 
called the change in potential. We calculated the gravita¬ 
tional potential at distance r from an attracting point mass 
by computing the work that would have to be expended 
against the field to move the unit mass from r to infinity. 

We can define the magnetic potential IFat a distance r 
from a pole of strength p in exactly the same way. The 
magnetic field of the pole is given by Eq. (5.2). Using the 
value ju 0 /4 tt for K and expressing the pole strength p in SI 
units, the magnetic potential at r is given by 

W=-]Bdr = ££ (5.3) 
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Fig. 5.2 Geometry for the calculation of the potential of a pair of 
magnetic poles. 


5.2.3 The magnetic dipole 

In Fig. 5.1 the line joining the positive and negative poles 
(or the normal to the plane of the loop, or the direction of 
magnetization of the sphere) defines an axis, about which 
the field has rotational symmetry. Let two equal and 
opposite poles, + p and - p, be located a distance d apart 
(Fig. 5.2). The potential IF at a distance r from the mid¬ 
point of the pair of poles, in a direction that makes an 
angle 9 to the axis, is the sum of the potentials of the pos¬ 
itive and negative poles. At the point (r, 9) the distances 
from the respective poles are r + and r and we get for the 
magnetic potential of the pair 


W— 


_Fo P( 1 


4IT 



(5.4) 




(5.5) 


The pair of opposite poles is considered to form a 
dipole when their separation becomes infinitesimally small 
compared to the distance to the point of observation (i.e., 
d<r). In this case, we get the approximate relations 

r + ~ r — ^cos 6 

(5- 6 ) 

r_ ~ r + ^cosd' 

When d « r, we can write 0®= 9' and terms of order (dir) 2 
and higher can be neglected. This leads to the further sim¬ 
plifications 

r_ - r + ~ (cos 9' +cos0) ~ dcos9 



torque = Fd sin 9 
X =pd B sin 9 = m x B 

Fig. 5.3 Definition of the magnetic moment m of a pair of magnetic 
poles. 

Substituting Eq. (5.7) in Eq. (5.5) gives the dipole poten¬ 
tial at the point (r, 9): 

TJ , Fo (dp)cos9 Mo mcos9 , c 0l 

^~4tt r 2 = 4tt r 2 ( 5 ' 8) 

The quantity = (dp) is called the magnetic moment 
of the dipole. This definition derives from observations 
on bar magnets. The torque exerted by a magnetic field 
to turn the magnet parallel to the field direction is pro¬ 
portional to m. This applies even when the separation 
of the poles becomes very small, as in the case of the 
dipole. 

The torque can be calculated by considering the forces 
exerted by a uniform magnetic field B on a pair of mag¬ 
netic poles of strength p separated by a distance d (Fig. 
5.3). A force equal to (Bp) acts on the positive pole and an 
equal and opposite force acts on the negative pole. If the 
magnetic axis is oriented at angle 9 to the field, the per¬ 
pendicular distance between the lines of action of the 
forces is d sin 9. The torque t felt by the magnet is equal 
to B(pd) sin 9 (i.e., t = mB sin 9). Taking into account the 
direction of the torque and using the conventional nota¬ 
tion for the cross product of two vectors this gives for the 
magnetic torque 

t = m X B (5.9) 

5.2.4 The magnetic field of an electrical current 

The equation used to define the magnetic B-field was 
formulated by Lorentz in 1879. Let q be an electrical 
charge that moves with velocity v through a magnetic 
field B (Fig. 5.4a). The charged particle experiences a 
deflecting force F given by Lorentz’s law, which in SI 
units is: 


r + r_ ~ r 2 — p^-cos 2 9 ~ r 2 


(5.7) 


¥ = q(\ X B) 


(5.10) 
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F = q (v x B ) 



Fig. 5.4 Illustrations of (a) Lorentz's law for the deflecting force F 
experienced by an electrical charge that moves with velocity v through a 
magnetic field B, and (b) the law of Biot and Savart for the force 
experienced by an element dl of a conductor carrying a current / in a 
magnetic field B. 

The SI unit of the magnetic B-field defined by this equa¬ 
tion is called a tesla', it has the dimensions N A -1 m -1 . 

Imagine the moving charge to be confined to move 
along a conductor of length dl and cross-section A (Fig. 
5.4b). Let the number of charges per unit volume be N. 
The number inside the element dl is then NAdl. Each 
charge experiences a deflecting force given by Eq. (5.10). 
Thus the total force transferred to the element dl is 

dF = NAdlq{\ X B) = NAvq(d\ X B) (5.11) 

The electrical current / along the conductor is the total 
charge that crosses A per second, and is given by 1= NAvq. 
From Eq. (5.11) we get the law of Biot and Savart for the 
force experienced by the element dl of a conductor carry¬ 
ing a current /in a magnetic field B: 

dF = /(dl X B) (5.12) 

The orienting effect of an electrical current on mag¬ 
netic compass needles, reported by Oersted and Ampere 
in 1820, is illustrated in Fig. 5.5. The magnetic field lines 
around an infinitely long straight wire form concentric 
circles in the plane normal to the wire. The strength of the 
B-field around the wire is 


= /V 

2-nr 


(5.13) 


The Biot-Savart law can be applied to determine the 
torque exerted on a small rectangular loop PQRS in a 



Fig. 5.5 Small compass needles show that the magnetic field lines 
around an infinitely long straight wire carrying an electrical current form 
concentric circles in a plane normal to the wire. 




Fig. 5.6 (a) Rectangular loop carrying a current / in a uniform magnetic 
field 6; (b) derivation of the torque t experienced by the loop. 

magnetic field (Fig. 5.6a). Let the lengths of the sides of 
the loop be a and b, respectively, and define the .v-axis 
parallel to the sides of length a. The area of the loop can 
be expressed as a vector with magnitude A = ab, and 
direction n normal to the plane of the loop. Suppose that 
a current / flows in the loop and that a magnetic field B 
acts normal to the .v-axis, making an angle 9 with the 
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normal to the plane of the loop. Applying Eq. (5.12), a 
force F x equal to (IbB cos 0) acts on the side PQ in the 
direction of +x; its effect is cancelled by an equal and 
opposite force F x acting on side RS in the direction of -x. 
Forces equal to ( IaB ) act in opposite directions on the 
sides QR and SP (Fig. 5.6b). The perpendicular distance 
between their lines of action is b sin 9, so the torque t 
experienced by the current loop is 

t = (IaB)bsind = (IA)B sin0 

t = m X B 

The quantity m = I A is a vector with direction parallel to 
the normal to the plane of the current loop. This expression 
is valid for an arbitrary small loop of area A, regardless of 
its shape. By comparing Eqs. (5.14) and (5.9) for the torque 
on a dipole, it is evident that m corresponds to the magnetic 
moment of the current loop. At distances greater than the 
dimensions of the loop, the magnetic field is that of a 
dipole at the center of the loop (Fig. 5.1b). The definition 
of m in terms of a current-carrying loop shows that mag¬ 
netic moment has the units of current times area (A m 2 ). 

5.2.5 Magnetization and the magnetic field inside a 
material 

A true picture of magnetic behavior requires a quantum- 
mechanical analysis. Fortunately, a working understand¬ 
ing of the magnetic behavior of materials can be acquired 
without getting involved in the quantum-mechanical 
details. The simplified concept of atomic structure intro¬ 
duced by Ernest Rutherford in 1911 gives a readily under¬ 
standable model for the magnetic behavior of materials. 
The motion of an electron around an atomic nucleus is 
treated like the orbital motion of a planet about the Sun. 
The orbiting charge forms an electrical current with 
which an orbital magnetic moment is associated. A planet 
also rotates about its axis; likewise each electron can be 
visualized as having a spin motion about an axis. The 
spinning electrical charge produces a spin magnetic 
moment. Each magnetic moment is directly related to the 
corresponding angular momentum. In quantum theory 
each type of angular momentum of an electron is quan¬ 
tized. Thus the spin and orbital magnetic moments are 
restricted to having discrete values. The spin magnetic 
moment is usually more important than the orbital 
moment in the rock-forming minerals (see Section 5.2.6). 

A simplified picture of the magnetic moments inside a 
material is shown in Fig. 5.7. The magnetic moment m of 
each atom is associated with a current loop as illustrated 
in Fig. 5.1b and described in the previous section. The net 
magnetic moment of a volume V of the material depends 
on the degree of alignment of the individual atomic mag¬ 
netic moments. It is the vector sum of all the atomic mag¬ 
netic moments in the material. The magnetic moment per 
unit volume of the material is called its magnetization , 
denoted M: 



Fig. 5.7 Schematic representation of the magnetic moments inside a 
material; each magnetic moment m is associated with a current loop on 
an atomic scale. 

M = 2 m , /K (5.15) 

Magnetization has the dimensions of magnetic 
moment (A m 2 ) divided by volume (m 3 ), so that the SI 
units of M are A m _1 . The dimensions of B are 
N A -1 m _1 and those of p. a are N A -2 ; consequently the 
dimensions of Blp. Q are also A m -1 . In general, the mag¬ 
netization M inside a magnetic material will not be 
exactly equal to Bl/x (j ; let the difference be H, so that 

H = B/p, 0 — M (5.16) 

In the earlier c.g.s. system H was defined by the vector 
equation H = B - 4 ttM, and the dimensions of H and B 
were the same. For this reason H became known as the 
magnetizing field (or H-field). It is a readily computed 
quantity that is useful in determining the value of the true 
magnetic field B in a medium. The fundamental differ¬ 
ence between the B-field and the H-field can be under¬ 
stood by inspection of the configurations of their 
respective field lines. The field lines of B always form 
closed loops (Fig. 5.1). The field lines of H are discontinu¬ 
ous at surfaces where the magnetization M changes in 
strength or direction. Magnetic methods of geophysical 
exploration take advantage of surface effects that arise 
where the magnetization is interrupted. 

Anomalous magnetic fields arise over geological struc¬ 
tures that cause a magnetization contrast between adjacent 
rock types. Many magnetic anomalies can be analyzed by 
replacing the change in magnetization at a surface by an 
appropriate surface distribution of fictitious magnetic 
poles. The methodology, though based on a fundamentally 
false concept, is quite practical for modelling anomaly 
shapes and is often much simpler than a physically correct 
analysis in terms of current distributions. For example, in a 
uniformly magnetized rod, the N-poles of the elementary 
magnetic moments are considered to be exposed on one 
end of the rod, with a corresponding distribution of S- 
poles on the opposite end; inside the material the N-poles 
and S-poles cancel each other (Fig. 5.8a). The H-field 
inside the material arises from these pole distributions and 
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Fig. 5.8 The magnetization 
of a material may be 
envisaged as due to an 
alignment of (a) small dipoles 
or (b) equivalent current 
loops; even in a permanent 
magnet the physical source of 
the B-field of the material is a 
system of electrical currents 
on an atomic scale. 
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acts in the opposite direction to the magnetization M. 
Outside the magnet the B-field and H-field are parallel; the 
H-field is discontinuous at the ends of the magnet. 

The same situation can be portrayed in terms of 
current loops. The physical source of every B-field is an 
electrical current, even in a permanent magnet (Fig. 
5.8b). Atomic current loops give a continuous B-field 
that emerges from the magnet at one end, re-enters at the 
other end and is closed inside the magnet. The aligned 
magnetic moments of the elementary current loops 
cancel out inside the body of the magnet, but the cur¬ 
rents in the loops adjacent to the sides of the magnet 
combine to form a surface “current" that maintains the 
magnetization M. 

In a vacuum there is no magnetization (M = 0); the 
vectors B and H are parallel and proportional 
(B = /u, 0 H). Inside a magnetizable material the magnetic 
B-field has two sources. One is the external system of 
real currents that produce the magnetizing field H; the 
other is the set of internal atomic currents that cause the 
atomic magnetic moments whose net alignment is 
expressed as the magnetization M. In a general, 
anisotropic magnetic material B, M and H are not par¬ 
allel. However, many magnetic materials are not 
strongly anisotropic and the elementary atomic mag¬ 
netic moments align in a statistical fashion with the mag¬ 
netizing field. In this case M and H are parallel and 
proportional to each other 

M = AH (5.17) 

The proportionality factor k is a physical property of 
the material, called the magnetic susceptibility. It is a 
measure of the ease with which the material can be mag¬ 
netized. Because M and H have the same units (A m _1 ), k 
is a dimensionless quantity. The susceptibility of most 


materials is temperature dependent, and in some materi¬ 
als (ferromagnets and ferrites) k depends on H in a com¬ 
plicated fashion. In general, Eq. (5.16) can be rewritten 

B = /x 0 (H + M)=/x 0 H(l+A') 

(5.18) 

B = MMo 11 

The quantity n = (1 + k) is called the magnetic perme¬ 
ability of the material. The term “permeability” recalls 
the early nineteenth century association of magnetic 
powers with an invisible fluid. For example, the perme¬ 
ability of a material expresses the ability of the material 
to allow a fluid to pass through it. Likewise, the magnetic 
permeability is a measure of the ability of a material to 
convey a magnetic flux. Ferromagnetic metals have high 
permeabilities; in contrast, minerals and rocks have low 
susceptibilities and permeabilities p. « 1. 

5.2.6 The magnetic properties of materials 

The magnetic behavior of a solid depends on the mag¬ 
netic moments of the atoms or ions it contains. As dis¬ 
cussed above, atomic and ionic magnetic moments are 
proportional to the quantized angular momenta associ¬ 
ated with the orbital motion of electrons about the 
nucleus and with the spins of the electrons about their 
own axes of rotation. In quantum theory the exclusion 
principle of Wolfgang Pauli states that no two electrons 
in a given system can have the same set of quantum 
numbers. When applied to an atom or ion, Pauli's princi¬ 
ple stipulates that each possible electron orbit can be 
occupied by up to two electrons with opposite spins. The 
orbits are arranged in shells around the nucleus. The 
magnetic moments of paired opposite spins cancel each 
other out. Consequently, the net angular momentum 
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and the net magnetic moment of a filled shell must be 
zero. The net magnetic moment of an atom or ion arises 
from incompletely filled shells that contain unpaired 
spins. The atoms or ions in a solid are not randomly dis¬ 
tributed but occupy fixed positions in a regular lattice, 
which reflects the symmetry of the crystalline structure 
and which controls interactions between the ions. Hence, 
the different types of magnetic behavior observed in 
solids depend not only on the presence of ions with 
unpaired spins, but also on the lattice symmetry and cell 
size. 

Three main classes of magnetic behavior can be distin¬ 
guished on the basis of magnetic susceptibility: diamagnet¬ 
ism, paramagnetism and ferromagnetism. In diamagnetic 
materials the susceptibility is low and negative, i.e., a mag¬ 
netization develops in the opposite direction to the applied 
field. Paramagnetic materials have low, positive suscepti¬ 
bilities. Ferromagnetic materials can be subdivided into 
three categories. True ferromagnetism is a cooperative 
phenomenon observed in metals like iron, nickel and 
cobalt, in which the lattice geometry and spacing allows 
the exchange of electrons between neighboring atoms. 
This gives rise to a molecular field by means of which the 
magnetic moments of adjacent atoms reinforce their 
mutual alignment parallel to a common direction. Ferro¬ 
magnetic behavior is characterized by high positive sus¬ 
ceptibilities and strong magnetic properties. The crystal 
structures of certain minerals permit an indirect coopera¬ 
tive interaction between atomic magnetic moments. This 
indirect exchange confers magnetic properties that are 
similar to ferromagnetism. The mineral may display anti¬ 
ferromagnetism or ferrimagnetism. The small group of fer- 
rimagnetic minerals is geophysically important, especially 
in connection with the analysis of the Earth’s paleo- 
magnetic field. 

5.2.6.1 Diamagnetism 

All magnetic materials show a diamagnetic reaction in a 
magnetic field. The diamagnetism is often masked by 
stronger paramagnetic or ferromagnetic properties. It is 
characteristically observable in materials in which all elec¬ 
tron spins are paired. 

The Lorentz law (Eq. (5.10)) shows that a change in 
the B-field alters the force experienced by an orbiting 
electron. The plane of the electron orbit is compelled to 
precess around the field direction; the phenomenon is 
called Larmor precession. It represents an additional 
component of rotation and angular momentum. The 
sense of the rotation is opposite to that of the orbital 
rotation about the nucleus. Hence, the magnetic moment 
associated with the Larmor precession opposes the 
applied field. As a result a weak magnetization propor¬ 
tional to the field strength is induced in the opposite 
direction to the field. The magnetization vanishes when 
the applied magnetic field is removed. Diamagnetic sus¬ 
ceptibility is reversible, weak and negative (Fig. 5.9a); it 





Fig. 5.9 (a) Variations of magnetization M with applied magnetic field 
H in paramagnetic and diamagnetic materials; (b) the variation of 
paramagnetic susceptibility with temperature, and (c) the linear plot of 
the inverse of paramagnetic susceptibility against temperature. 

is independent of temperature. Many important rock¬ 
forming minerals belong to this class, amongst them 
quartz and calcite. They have susceptibilities around 
— 10 _6 in SI units. 

52.6.2 Paramagnetism 

Paramagnetism is a statistical phenomenon. When one or 
more electron spins is unpaired, the net magnetic moment 
of an atom or ion is no longer zero. The resultant mag¬ 
netic moment can align with a magnetic field. The align¬ 
ment is opposed by thermal energy which favors chaotic 
orientations of the spin magnetic moments. The magnetic 
energy is small compared to the thermal energy, and in 
the absence of a magnetic field the magnetic moments are 
oriented randomly. When a magnetic field is applied, the 
chaotic alignment of magnetic moments is biassed 
towards the field direction. A magnetization is induced 
proportional to the strength of the applied field and par¬ 
allel to its direction. The susceptibility is reversible, small 
and positive (Fig. 5.9a). An important paramagnetic 
characteristic is that the susceptibility k varies inversely 
with temperature (Fig. 5.9b) as given by the Curie law 

k = £ (5.19) 

where the constant C is characteristic of the material. 
Thus, a plot of Ilk against temperature is a straight line 
(Fig. 5.9c). In solids and liquids mutual interactions 
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Fig. 5.10 Schematic 
representations of the 
alignments of atomic 
magnetic moments in (a) 
ferromagnetism, (b) 
antiferromagnetism, (c) spin- 
canted antiferromagnetism, 
and (d) ferrimagnetism. 
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between ions may be quite strong and paramagnetic behav¬ 
ior is only displayed when the thermal energy exceeds a 
threshold value. The temperature above which a solid is 
paramagnetic is called the paramagnetic Curie temperature 
or Weiss constant of the material, denoted by 6; it is close 
to zero kelvin in paramagnetic solids. At temperatures 
T>6 the paramagnetic susceptibility k is given by the 
Curie- Weiss law 

(5.20) 

For a solid the plot of 1 Ik against ( T- 9) is a straight line 
(Fig. 5.9c). Many clay minerals and other rock-forming 
minerals (e.g., chlorite, amphibole, pyroxene, olivine) are 
paramagnetic at room temperature, with susceptibilities 
commonly around 10 -5 —10 -4 in SI units. 

5.2.63 Ferromagnetism 

In paramagnetic and diamagnetic materials the interac¬ 
tions between individual atomic magnetic moments are 
small and often negligible. However, in some metals (e.g., 
iron, nickel, cobalt) the atoms occupy lattice positions 
that are close enough to allow the exchange of electrons 
between neighboring atoms. The exchange is a quantum- 
mechanical effect that involves a large amount of energy, 
called the exchange energy of the metal. The exchange 
interaction produces a very strong molecular field within 
the metal, which aligns the atomic magnetic moments 
(Fig. 5.10a) exactly parallel and produces a spontaneous 
magnetization (M s ). The magnetic moments react in 
unison to a magnetic field, giving rise to a class of strong 
magnetic behavior known as ferromagnetism. 

A rock sample may contain thousands of tiny ferro¬ 
magnetic mineral grains. The magnetization loop of a 
rock sample shows the effects of magnetic hysteresis (Fig. 
5.11). In strong fields the magnetization reaches a satura¬ 
tion value (equal to M s ), at which the individual magnetic 
moments are aligned with the applied field. If the magne¬ 
tizing field is reduced to zero, a ferromagnetic material 
retains part of the induced magnetization. The residual 


M 



Fig. 5.11 The magnetization loop of an arbitrary ferromagnetic 
material. 

magnetization is called the remanence, or isothermal rema¬ 
nent magnetization (IRM); if the sample is magnetized to 
saturation, the remanence is a saturation IRM (M rs ). For a 
given ferromagnetic mineral, the ratio MJM s depends on 
grain size. If a magnetic field is applied in the opposite 
direction to the IRM, it remagnetizes part of the material 
in the antiparallel direction. For a particular value H c of 
the reverse field (called the coercive force) the induced 
reverse magnetization exactly cancels the original rema¬ 
nence and the net magnetization is zero. If the reverse field 
is removed at this stage, the residual remanence is smaller 
than the original IRM. By repeating the process in ever 
stronger reverse fields a back-field H cv (called the coerciv- 
ity of remanence) is found which gives a reverse remanence 
that exactly cancels the IRM, so that the residual rema¬ 
nence is zero. The ratio HJH c also depends on grain size. 
Rock-forming magnetic minerals often have natural rema- 
nences with very high coercive properties. 
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When a ferromagnetic material is heated, its sponta¬ 
neous magnetization disappears at the ferromagnetic 
Curie temperature (T c ). At temperatures higher than the 
paramagnetic Curie temperature ( 6 ) the susceptibility k 
becomes the paramagnetic susceptibility, so that 1 Ik is 
proportional to (T - 0) as given by the Curie-Weiss law 
(Eq. (5.20)). The paramagnetic Curie temperature for a 
ferromagnetic solid is several degrees higher than the fer¬ 
romagnetic Curie temperature, T c . The gradual transition 
from ferromagnetic to paramagnetic behavior is explained 
by persistence of the molecular field due to short-range 
magnetic order above T c . 

5.2.64 Antiferromagnetism 

In oxide crystals the oxygen ions usually keep the metal 
ions far apart, so that direct exchange of electrons 
between the metal ions is not possible. However, in 
certain minerals, interaction between magnetic spins 
becomes possible by the exchange of electrons from one 
metal ion to another through the electron “cloud” of the 
oxygen ion. This indirect exchange (or superexchange ) 
process results in antiparallel directions of adjacent 
atomic magnetic moments (Fig. 5.10b), giving two sub¬ 
lattices with equal and opposite intrinsic magnetic 
moments. As a result, the susceptibility of an antiferro¬ 
magnetic crystal is weak and positive, and remanent 
magnetization is not possible. The antiferromagnetic 
alignment breaks down at the Neel temperature, above 
which paramagnetic behavior is shown. The Neel tem¬ 
perature T n of many antiferromagnetic substances is 
lower than room temperature, at which they are para¬ 
magnetic. A common example of an antiferromagnetic 
mineral is ilmenite (FeTiO,), which has a Neel tempera¬ 
ture of 50 K. 

52.6.5 Parasitic ferromagnetism 

When an antiferromagnetic crystal contains defects, 
vacancies or impurities, some of the antiparallel spins are 
unpaired. A weak “defect moment” can result due to 
these lattice imperfections. Also if the spins are not 
exactly antiparallel but are inclined at a small angle, they 
do not cancel out completely and again a ferromagnetic 
type of magnetization can result (Fig. 5.10c). Materials 
that exhibit this form of parasitic ferromagnetism have the 
typical characteristics of a true ferromagnetic metal, 
including hysteresis, a spontaneous magnetization and a 
Curie temperature. An important geological example is 
the common iron mineral hematite (a-Fe 2 0 3 ), in which 
both the spin-canted and defect moments contribute to 
the ferromagnetic properties. Hematite has a variable, 
weak spontaneous magnetization of about 2000 Am -1 , 
very high coercivity and a Curie temperature around 
675 °C. The variable magnetic properties are due to varia¬ 
tion in the relative importances of the defect and spin- 
canted moments. 


52.6.6 Ferrimagnetism 

The metallic ions in an antiferromagnet occupy the voids 
between the oxygen ions. In certain crystal structures, of 
which the most important geological example is the spinel 
structure, the sites of the metal ions differ from each other 
in the coordination of the surrounding oxygen ions. 
Tetrahedral sites have four oxygen ions as nearest neigh¬ 
bors and octahedral sites have six. The tetrahedal and 
octahedral sites form two sublattices. In a normal spinel 
the tetrahedral sites are occupied by divalent ions and the 
octahedral sites by Fe 3+ ions. The most common iron 
oxide minerals have an inverse spinel structure. Each sub¬ 
lattice has an equal number of Fe 3+ ions. The same 
number of divalent ions (e.g. Fe 2+ ) occupy other octahe¬ 
dral sites, while the corresponding number of tetrahedral 
sites is empty. 

When the indirect exchange process involves antipar¬ 
allel and unequal magnetizations of the sublattices (Fig. 
5.lOd), resulting in a net spontaneous magnetization, the 
phenomenon is called ferrimagnetism. Ferrimagnetic 
materials (called ferrites ) exhibit magnetic hysteresis and 
retain a remanent magnetization when they are removed 
from a magnetizing field. Above a given temperature - 
sometimes called the ferrimagnetic Neel temperature but 
more commonly the Curie temperature - the long-range 
molecular order breaks down and the mineral behaves 
paramagnetically. The most important ferrimagnetic 
mineral is magnetite (Fe 3 0 4 ), but maghemite, pyrrhotite 
and goethite are also significant contributors to the mag¬ 
netic properties of rocks. 

5.2.7 Magnetic anisotropy 

Anisotropy is the directional dependency of a property. 
The magnetism of metals and crystals is determined by 
the strengths of the magnetic moments associated with 
atoms or ions, and the distances between neighbors. Here 
the symmetry of the lattice plays an important role, and 
so the magnetic properties of most ferromagnetic materi¬ 
als depend on direction. Magnetic anisotropy is an 
important factor in the dependence on grain size of the 
magnetic behavior of rocks and minerals. There are three 
important types: magnetocrystalline, magnetostatic and 
magnetostrictive anisotropies. 

5.2.7.1 Magnetocrystalline anisotropy 

The direction of the spontaneous magnetization (M s ) in a 
ferromagnetic metal is not arbitrary. The molecular field 
that produces M s originates in the direct exchange of elec¬ 
tron spins between neighboring atoms in a metal. The 
symmetry of the lattice structure of the metal affects the 
exchange process and gives rise to a magnetocrystalline 
anisotropy energy, which has a minimum value when M s is 
parallel to a favored direction referred to as the easy axis 
(or easy direction) of magnetization. The simplest form of 
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magnetic anisotropy is uniaxial anisotropy, when a metal 
has only a single easy axis. For example, cobalt has a 
hexagonal structure and the easy direction is parallel to 
the c-axis at room temperature. Iron and nickel have cubic 
unit cells; at room temperature, the easy axes in iron are 
the edges of the cube, but the easy axes in nickel are the 
body diagonal directions. 

Magnetocrystalline anisotropy is also exhibited by fer¬ 
rites, including the geologically important ferrimagnetic 
minerals. The exchange process in a ferrite is indirect, but 
energetically preferred easy axes of magnetization arise 
that reflect the symmetry of the crystal structure. This 
gives rise to different forms of the anisotropy in hematite 
and magnetite. 

Hematite has a rhombohedral or hexagonal structure 
and a uniaxial anisotropy with regard to the c-axis of sym¬ 
metry. Oxygen ions form a close-packed hexagonal lattice 
in which two-thirds of the octahedral interstices are occu¬ 
pied by ferric (Fe 3+ ) ions. When the spontaneous magne¬ 
tization makes an angle f with the c-axis of the crystal, 
the uniaxial anisotropy energy density can be written to 
first order 

E a = K u s in 2 0 (5.21) 

K u is called the uniaxial magnetocrystalline anisotropy 
constant. Its value in hematite is around -10 3 J m -3 at 
room temperature. The negative value of K u in Eq. (5.21) 
means that E d decreases as the angle cj) increases, and is 
minimum when sin 2 ^> is maximum, i.e., when cj> is 90°. As 
a result, the spontaneous magnetization lies in the basal 
plane of the hematite crystal at room temperature. 

Because of its inverse spinel structure, magnetite has 
cubic anisotropy. Let the direction of the spontaneous 
magnetization be given by direction cosines a v a 2 and a 3 
relative to the edges of the cubic unit cell (Box 1.5). The 
magnetocrystalline anisotropy energy density is then 
given by 

E d = K x (a\a\ + + a^af) + K 2 a\a\a^ (5.22) 

The anisotropy constants and K 2 of magnetite are 
equal to — 1.36X 10 4 J m~ 3 and — 0.44X 10 4 J m -3 , 
respectively, at room temperature. Because these con¬ 
stants are negative, the anisotropy energy density E d is 
minimum when the spontaneous magnetization is along a 
[111] body diagonal, which is the magnetocrystalline easy 
axis of magnetization at room temperature. 

52.7.2 Magnetostatic (shape) anisotropy 

In strongly magnetic materials the shape of the magne¬ 
tized object causes a magnetostatic anisotropy. In rocks 
this effect is associated with the shapes of the individual 
grains of ferrimagnetic mineral in the rock, and to a lesser 
extent with the shape of the rock sample. The anisotropy 
is magnetostatic in origin and can be conveniently 
explained with the aid of the concept of magnetic poles. 



^_ demagnetizing 

field 

Fig. 5.12 The origin of shape anisotropy: the distributions of 
magnetic poles on surfaces that intersect the magnetization of a 
uniformly magnetized prolate ellipsoid produce internal demagnetizing 
fields; these are weak parallel to the long axis (a) and strong parallel to 
the short axis (b). As a result, the net magnetization is stronger parallel 
to the long axis than parallel to the short axis. 

The spontaneous magnetization of a uniformly mag¬ 
netized material can be pictured as giving rise to a distrib¬ 
ution of poles on the free end surfaces (Fig. 5.12). As 
noted above, a property of the magnetic B-field is that its 
field lines form closed loops, whereas the H-field begins 
and ends on boundary surfaces, at which it is discontinu¬ 
ous. The field lines of the magnetic field H outside a 
magnet are parallel to the B-field and are directed from 
the surface distribution of N-poles to the distribution of 
S-poles. In the absence of an externally applied field the 
H-field inside the magnet is also directed from the distrib¬ 
ution of N-poles on one end to the S-poles on the other 
end. It forms a demagnetizing field (H d ) that opposes the 
magnetization. The strength of the demagnetizing field 
varies directly with the surface density of the magnetic 
pole distribution on the end surfaces of the magnet, and 
inversely with the distance between these surfaces. H d 
thus depends on the shape of the magnet and the intensity 
of magnetization; it can be written 

H d = - AM (5.23) 

N is called the demagnetizing factor. It is a dimensionless 
constant determined by the shape of the magnetic grain. It 
can be computed for a geometrical shape, such as a triaxial 
ellipsoid. The demagnetizing factors N v N 2 and N 3 parallel 
to the symmetry axes of an ellipsoid satisfy the relationship 

Aj T N 2 T N 3 = 1 (5.24) 

The magnetostatic energy of the interaction of the grain 
magnetization with the demagnetizing field is called the 
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demagnetizing energy (E d ). For a grain with uniform mag¬ 
netization M in a direction with demagnetizing factor N, 

E d = ^NM 2 (5.25) 

Consider the shape anisotropy of a small grain shaped like 
a prolate ellipsoid. When the spontaneous magnetization 
M s is along the long axis of the ellipsoid (Fig. 5.12a), the 
opposing pole distributions are further away from each 
other and their surface density is lower than when M s is 
parallel to the short axis (Fig. 5.12b). The demagnetizing 
field and energy are smallest when M s is parallel to the long 
axis, which is the energetically favored direction of magne¬ 
tization. The demagnetizing energy is larger in any other 
direction, giving a shape anisotropy. If TV, is the demagne¬ 
tizing factor for the long axis and N 2 that for the short axis 
(TV, < V,), the difference in energy between the two direc¬ 
tions of magnetization defines a magnetostatic anisotropy 
energy density given by 

u n 

E a = ^{N 2 - N X )M 2 (5.26) 

which is minimum when M s is parallel to the longest 
dimension of the grain. 

Shape-dependent magnetic anisotropy is important in 
minerals that have a high spontaneous magnetization. The 
more elongate the grain is, the higher the shape anisotropy 
will be. It is the predominant form of anisotropy in very 
fine grains of magnetite (and maghemite) if the longest 
axis exceeds the shortest axis by only about 20%. 

5.2.73 Magnetostrictive anisotropy 

The process of magnetizing some materials causes them 
to change shape. Within the crystal lattice the interaction 
energy between atomic magnetic moments depends on 
their separations (called the bond length) and on their 
orientations, i.e., on the direction of magnetization. If an 
applied field changes the orientations of the atomic mag¬ 
netic moments so that the interaction energy is increased, 
the bond lengths adjust to reduce the total energy. This 
produces strains which result in a shape change of the fer¬ 
romagnetic specimen. This phenomenon is called magne¬ 
tostriction. A material which elongates in the direction of 
magnetization exhibits positive magnetostriction, while a 
material that shortens parallel to the magnetization 
shows negative magnetostriction. The maximum differ¬ 
ence in magnetoelastic strain, which occurs between the 
demagnetized state and that of saturation magnetization, 
is called the saturation magnetostriction, denoted A s . 

The inverse effect is also possible. For example, if pres¬ 
sure is applied to one face of a cubic crystal, it will 
shorten elastically along the direction of the applied 
stress and will expand in directions perpendicular to it. 
These strains alter the separations of atomic magnetic 
moments, thereby perturbing the effects that give rise to 
magnetocrystalline anisotropy. Thus the application of 


stress to a magnetic material can change its magnetiza¬ 
tion; the effect is called piezomagnetism. On a submicro- 
scopic scale the stress field that surrounds a vacancy, 
defect or dislocation in the crystal structure can locally 
affect the orientations of ionic magnetic spins. 

Magnetostriction is a further source of anisotropy in 
magnetic minerals. The magnetostrictive (or magneto¬ 
elastic) anisotropy energy density E & depends on the 
amount and direction of the stress a. If the saturation 
magnetization makes an angle 6 to the stress, E d is given 
for a uniaxial magnetic mineral by 

£ a = ! A s o-cos 2 0 (5.27) 

This is the simplest expression for magnetostrictive 
energy. It assumes that the magnetostriction is isotropic, 
i.e., that it has the same value in all directions. This condi¬ 
tion is fulfilled if the magnetocrystalline axes of the ferri- 
magnetic minerals in a rock are randomly distributed. 
The magnetoelastic energy of a cubic mineral is more 
complicated. Instead of a single magnetostriction con¬ 
stant A s , separate constants A 100 and A,,, are required for 
the saturation magnetostriction along the [100] and [111] 
directions, respectively, corresponding to the edge and 
body diagonal directions of the cubic unit cell. 

In magnetite the magnetoelastic energy is more than an 
order of magnitude less than the magnetocrystalline energy 
at room temperature. Consequently, magnetostriction plays 
only a secondary role in determining the direction of mag¬ 
netization of magnetite grains. However, in minerals that 
have high magnetostriction (e.g., titanomagnetites (see 
Section 5.3.2.1) with a compositional factor ,v«0.65) the 
magnetoelastic energy may be significant in determining 
easy directions of magnetization, and the magnetization 
may be sensitive to modification by deformation. 

5.3 ROCK MAGNETISM 

5.3.1 The magnetic properties of rocks 

A rock may be regarded as a heterogeneous assemblage of 
minerals. The matrix minerals are mainly silicates or car¬ 
bonates, which are diamagnetic in character. Interspersed 
in this matrix is a lesser quantity of secondary minerals 
(such as the clay minerals) that have paramagnetic prop¬ 
erties. The bulk of the constituent minerals in a rock con¬ 
tribute to the magnetic susceptibility but are incapable of 
any contribution to the remanent magnetic properties, 
which are due to a dilute dispersion of ferrimagnetic min¬ 
erals (e.g., commonly less than 0.01% in a limestone). The 
variable concentrations of ferrimagnetic and matrix min¬ 
erals result in a wide range of susceptibilities in rocks 
(Fig. 5.13). 

The weak and variable concentration of ferrimagnetic 
minerals plays a key role in determining the magnetic 
properties of the rock that are significant geologically and 
geophysically. The most important factors influencing 
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Fig. 5.13 (a) Median values and ranges of the magnetic susceptibility 
of some common rock types, and (b) the susceptibilities of some 
important minerals. 
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Fig. 5.14 Ternary compositional diagram of the iron-titanium oxide 
solid solution magnetic minerals (after McElhinny, 1973). 

titanohematite series. The minerals of a third series, 
pseudobrookite, are paramagnetic at room temperature. 
They are quite rare and are of minor importance in rock 
magnetism. The compositions of naturally occurring 
forms of titanomagnetite and titanohematite usually plot 
as points on the ternary diagram that are displaced from 
the ideal lines towards the Ti0 2 -Fe 1 0 3 axis, which indi¬ 
cates that they are partly oxidized. 


rock magnetism are the type of ferrimagnetic mineral, its 
grain size, and the manner in which it acquires a remanent 
magnetization. 

5.3.2 The ternary oxide system of magnetic minerals 

The most important magnetic minerals are iron-titanium 
oxides, which are naturally occurring ferrites. The mineral 
structure consists of a close-packed lattice of oxygen ions, 
in which some of the interstitial spaces are occupied by 
regular arrays of ferrous (Fe 2+ ) and ferric (Fe 3+ ) iron ions 
and titanium (Ti 4+ ) ions. The relative proportions of 
these three ions determine the ferrimagnetic properties of 
the mineral. The composition of an iron-titanium oxide 
mineral can be illustrated graphically on the ternary oxide 
diagram (Fig. 5.14), the corners of which represent the 
minerals rutile (Ti0 2 ), wustite (FeO), and hematite 
(Fe 2 0 3 ). The proportions of these three oxides in a 
mineral define a point on the ternary diagram. The verti¬ 
cal distance of the point above the FeO-Fe,0 3 baseline 
reflects the amount of titanium in the lattice. Flematite is 
in a higher state of oxidation than wustite; hence the hori¬ 
zontal position along the Fe0-Fe 2 0 3 axis expresses the 
degree of oxidation. 

The most important magnetic minerals belong to two 
solid-solution series: (a) the titanomagnetite, and (b) the 


5.3.2.1 The titanomagnetite series 

Titanomagnetite is the name of the family of iron oxide 
minerals described by the general formula Fe 3 _ v Ti Y 0 4 
(0 <i< 1). These minerals have an inverse spinel structure 
and exemplify a solid-solution series in which ionic 
replacement of two Fe 3+ ions by one Fe 2+ and one Ti 4+ 
ion can take place. The compositional parameter x 
expresses the relative proportion of titanium in the unit 
cell. The end members of the solid-solution series are mag¬ 
netite (Fe 3 0 4 ), which is a typical strongly magnetic ferrite, 
and ulvospinel (Fe 2 Ti0 4 ), which is antiferromagnetic at 
very low temperature but is paramagnetic at room temper¬ 
ature. An alternative form of the general formula is 
xFe 2 Ti0 4 (l - x)Fe 3 0 4 . Written in this way, it is apparent 
that the compositional parameter x describes the molecu¬ 
lar fraction of ulvospinel. As the amount of titanium (x) 
increases, the cell size increases and the Curie temperature 
6 and spontaneous magnetization M s of the titanomag¬ 
netite decrease (Fig. 5.15). 

Magnetite is one of the most important ferrimagnetic 
minerals. It has a strong spontaneous magnetization 
(M = 4.8X10 5 Am -1 ) and a Curie temperature of 
578 °C. Because of the high value of M s , magnetite grains 
can have a strong shape anisotropy. The magnetic suscep¬ 
tibility is the strongest of any naturally occurring mineral 
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Fig. 5.15 Variations of Curie temperature and unit-cell size with 
composition in titanomagnetite (after Nagata, 1961). 


(k ~ 1—10 SI). For many sedimentary and igneous rocks 
the magnetic susceptibility is proportional to the mag¬ 
netite content. 

Maghemite (y-Fe^) can be produced by low- 
temperature oxidation of magnetite. It is a strongly mag¬ 
netic mineral (Af s = 4.5 x10 s A m -1 ). Experiments on 
maghemite doped with small amounts of foreign ions indi¬ 
cate that it has a Curie temperature of 675 °C. However, it 
is metastable and reverts to hematite (a-Fe 2 0 3 ) when 
heated above 300-350 °C. The low-temperature oxidation 
of titanomagnetite leads to a “titanomaghemite” solid- 
solution series. 

Titanomagnetite is responsible for the magnetic prop¬ 
erties of oceanic basalts. The basaltic layer of the oceanic 
crust is the main origin of the marine magnetic anomalies 
that are of vital importance to modern plate tectonic 
theory. The magnetic properties of the 0.5 km thick 
basaltic layer are due to the presence of very fine grained 
titanomagnetite (or titanomaghemite, depending on the 
degree of ocean-floor weathering). The molecular frac¬ 
tion (x) of Fe 2 Ti0 4 in titanomagnetite in oceanic basalts 
is commonly around 0.6. 

53.2.2 The titanohematite series 

The minerals of the titanohematite solid-solution series 
are also variously referred to as “hemoilmenite,” 
“hematite-ilmenite” or “ilmenohematite.” They have the 
general formula Fe 2v Ti v 0 3 . The unit cell has rhombohe- 
dral symmetry. Ionic substitution is the same as for titano- 
magnetite, and the compositional parameter x has the 
same implications for the titanium content of the unit cell. 
The end members of the solid-solution series are hematite 
(Fe 2 0 3 ) and ilmenite (FeTi0 3 ). The chemical formula can 
be written in the alternative form xFeTiOj • (1 - x)Fe 2 0 3 , 
where x represents the molecular fraction of ilmenite. 
As in the case of titanomagnetite, the cell size increases 
and the Curie point decreases as the titanium content 
increases. The Curie point of hematite is 675 °C, while 


ilmenite is antiferromagnetic at low temperature and 
paramagnetic at room temperature. For titanium contents 
0.5<x<0.95 titanohematite is ferrimagnetic and for 
x<0.5 it exhibits parasitic ferromagnetism. 

The end member hematite (a-Fe 2 0 3 ) is an extremely 
important magnetic mineral. Its magnetic properties 
arise from parasitic ferromagnetism due to the spin- 
canted magnetic moment and the possible defect moment 
of its otherwise antiferromagnetic lattice. Hematite has a 
weak spontaneous magnetization (M = 2.2 X 10 3 A m -1 ) 
and a strong uniaxial magnetocrystalline anisotropy 
(7sT u «10 3 J m -3 ). Hematite is paleomagnetically impor¬ 
tant because of its common occurrence and its high mag¬ 
netic and chemical stability. It often occurs as a 
secondary mineral, formed by oxidation of a precursor 
mineral, such as magnetite, or by precipitation from 
fluids passing through a rock. 

5.3.3 Other ferrimagnetic minerals 

Although the iron-titanium oxides are the dominant 
magnetic minerals, rocks frequently contain other miner¬ 
als with ferromagnetic properties. Although pyrite (FeS 2 ) 
is a very common sulfide mineral, especially in sedimen¬ 
tary rocks, it is paramagnetic and therefore cannot carry a 
remanent magnetization. As a result it does not con¬ 
tribute directly to the paleomagnetic properties of rocks, 
but it may act as a source for the formation of goethite or 
secondary magnetite. 

Pyrrhotite is a common sulfide mineral which can 
form authigenically or during diagenesis in sediments, 
and which can be ferrimagnetic in certain compositional 
ranges. It is non-stoichiometric (i.e., the numbers of 
anions and cations in the unit cell are unequal) and has 
the formula Fej Y S. The parameter x refers to the propor¬ 
tion of vacancies among the cation lattice sites and is 
limited to the range 0 < x < 0.14. Pyrrhotite has a pseudo- 
hexagonal crystal structure and would be antiferromag¬ 
netic but for the presence of the cation vacancies. The 
Neel temperature at which the fundamental antiferro- 
magnetism disappears is around 320 °C. Pyrrhotite with 
the formula Fe 7 S 8 is ferrimagnetic with a Curie tempera¬ 
ture close to the Neel temperature and a strong sponta¬ 
neous magnetization of about 10 s Am -1 at room 
temperature. The magnetocrystalline anisotropy restricts 
the easy axis of magnetization to the hexagonal basal 
plane at room temperature. 

The iron oxyhydroxide goethite (FeOOH) is another 
common authigenic mineral in sediments. Like hematite, 
goethite is antiferromagnetic, but has a weak parasitic fer¬ 
romagnetism. It has a very high coercivity (with maximum 
values in excess of 5 T) and a low Curie point around 100 
°C or less. It is thermally unstable relative to hematite 
under most natural conditions, and decomposes on 
heating above about 350 °C. It is a common (and paleo- 
magnetically undesirable) secondary mineral in lime¬ 
stones and other sedimentary rocks. 
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5.3.4 Identification of ferrimagnetic minerals 

It is often difficult to identify the ferrimagnetic minerals 
in a rock, because their concentration is so low, especially 
in sedimentary rocks. If the rock is coarse grained, ferri¬ 
magnetic minerals may be identified optically among the 
opaque grains by studying polished sections in reflected 
light. However, in many rocks that are paleomagnetically 
important (e.g., basaltic lava, pelagic limestone) optical 
examination may be unable to resolve the very fine grain 
size of the ferrimagnetic mineral. The ferrimagnetic 
mineral fraction may then be identified by its properties 
of Curie temperature and coercivity. 

The Curie temperature is measured using a form of 
balance in which a strong magnetic field gradient exerts a 
force on the sample that is proportional to it s magnetization. 
The field (usually 0.4-1 T) is strong enough to saturate the 
magnetization of many minerals. The sample is heated and 
the change of magnetic force (i.e., sample magnetization) is 
observed with increasing temperature. When the Curie 
point is reached, the ferromagnetic behavior disappears; at 
higher temperatures the sample is paramagnetic. The Curie 
point is diagnostic of many minerals. For example, an 
extract of magnetic minerals from a pelagic limestone (Fig. 
5.16) shows the presence of goethite (—100 °C Curie point, 
sample SR3A), magnetite (—570 °C Curie point, all 
samples) and hematite (—650 °C Curie point, sample SRI 1). 
Some Curie balances are sensitive enough to analyze whole 
rock samples, but this is generally only possible in strongly 
magnetized igneous rocks. For most rocks it is necessary to 
extract the ferrimagnetic minerals, or to concentrate them. 
The extraction is sometimes difficult, and often it is not 
certain that the extract is representative of the rock as a 
whole. This is also a drawback of optical methods, which 
only allow description of large grains. 

To avoid the difficulties and uncertainties associated 
with making a special extract or concentrate of the ferri¬ 
magnetic fraction, alternative methods of ferrimagnetic 
mineral identification, based on bulk magnetic properties, 
are more widely used. One simple method makes use of 
the coercivities and Curie temperatures, as expressed in 
the thermal demagnetization of the isothermal remanent 
magnetization (see Section 5.3.6.4). Another method, 
useful for pure magnetite and hematite, takes advantage 
of the low-temperature variations of the magnetocrys- 
talline anisotropy constants of these minerals. 

5.3.5 Grain size dependence of ferrimagnetic properties 

The ferromagnetic properties of metals and ferrites vary 
sensitively with grain size. Consider an assemblage of uni¬ 
formly magnetized grains of a ferrimagnetic mineral char¬ 
acterized by a spontaneous magnetization M s and uniform 
grain volume v. Let the spontaneous magnetization be 
oriented parallel to an easy direction of magnetization 
(crystalline or shape determined), defined by the anisotropy 
energy K u per unit volume. The energy that keeps the 
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Fig. 5.16 Identification of the ferromagnetic minerals in a pelagic 
limestone by determination of their Curie temperatures in concentrated 
extracts (after Lowrie and Alvarez, 1975). 

magnetization parallel to the easy direction is equal to vK u . 
Thermal energy, proportional to the temperature, has the 
effect of disturbing this alignment. At temperature T the 
thermal energy of a grain is equal to kT, where k is 
Boltzmann’s constant (k= 1.381 X 10 - 23 J K -1 ). At any 
instant in time there is a chance that thermal energy will 
deflect the magnetic moment of a grain away from its easy 
direction. Progressively, the net magnetization of the mate¬ 
rial (the sum of all the magnetic moments of the numerous 
magnetic grains) will be randomized by the thermal energy, 
and the magnetization will be observed to decay. If the 
initial magnetization of the assemblage is M r0 , after time 1 
it will decrease exponentially to M r ( t), according to 

M r (0 = M r0 ex p(—|) (5.28) 

In this equation t is known as the relaxation time of 
the grain (Box 5.1). If the relaxation time is long, the 
exponential decrease in Eq. (5.28) is slow and the magne¬ 
tization is stable. The parameter t depends on properties 
of the grain and is given by the equation 

r = ^exp(^) (5.29) 

The constant v Q is related to the lattice vibrational fre¬ 
quency and has a very large value (=10 8 -10 10 s _1 ). The 
value of K u depends on whether the easy direction of the 
magnetic mineral is determined by the magnetocrystalline 
anisotropy or the magnetostatic (shape) anisotropy. For 
example, in hematite the magnetocrystalline anisotropy 
prevails because the spontaneous magnetization is 
very weak, and K u is equal to the magnetocrystalline 
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Box 5.1: Magnetic relaxation 


Relaxation behavior is characterized by the exponential 
return of a physical property with time from a state of 
elevated energy to a state of lower energy. The most 
familiar example is radioactive decay (Section 4.1.3.1), 
but the magnetizations of natural materials also exhibit 
relaxation behavior, as explained in Section 5.3.5. 

In the absence of an external field, the easy direction 
of magnetization of a single domain magnetic grain 
with volume v and anisotropy energy K u per unit volume 
is determined by the anisotropy energy vK u (Section 
5.2.7). The probability that the grain’s thermal energy 
kT can overcome this energy barrier and allow the grain 
magnetization to change direction is determined by the 
Maxwell-Boltzmann distribution of energies, and is 
proportional to exp(-vKJkT). The probability per unit 
time. A, of a magnetic moment changing to a different 
easy axis is given by the Arrhenius equation for a ther¬ 
mally activated process 

A = Cexp(-^) (1) 

The parameter C is called the frequency factor; here it is 
the lattice vibration frequency, v o . At a particular tem¬ 
perature, all parameters on the right side of Eq. (1) are 
constant and so the probability A per unit time of a 
magnetic moment changing to a different easy axis is 
constant. 

Now suppose an assemblage of identical non¬ 
interacting single domain particles, of which A, are 
magnetized initially in one direction (state 1) and A, in 
the opposite direction (state 2). The net magnetization is 
proportional to (A, - A 2 ). Assuming a constant proba¬ 


bility A per unit time, the number of particles dN x that 
change from state 1 to state 2 is proportional to the time 
interval dt and to the number of grains Aj in state 1. It is 
given by 

dA, = — A A, dt (2) 

where the negative sign indicates a reduction in N v 
Similarly, the number changing in the opposite sense 


from state 2 to state 1 is 

dA,= - AA 2 dr (3) 

The net change in magnetization is therefore 

dM = /idN l - pdN 2 = /xd(Aj — A,) (4) 

dM= -A/x(Aj - A 2 )dr= -XMdt (5) 

The solution of this differential equation is 

M= M 0 exp( — At) = M 0 exp(— (6) 

where t, the relaxation time, is the inverse of A in Eq. (1): 

T = ^o exp (^) (7) 


Equations (6) and (7) are very important in paleomag- 
netism. The strong anisotropy of fine grained ferromag¬ 
netic minerals can result in very long relaxation times, 
and consequently the magnetizations of fine grained 
rocks can be extremely stable over geological lengths of 
time. 


anisotropy. In grains of magnetite the value of K u is equal to 
((K 1 B)+ (K 2 1 27)), if magnetocrystalline anisotropy con¬ 
trols the magnetization (as in an equidimensional grain). If 
the magnetite grain is elongate with demagnetizing factors 
Aj and A 2 , shape anisotropy determines K u , which is then 
equal to the energy density A a given by Eq. (5.26). 

This theory applies only to very small grains that are 
uniformly magnetized. Fine grained ferrimagnetic miner¬ 
als are, however, very important in paleomagnetism and 
rock magnetism. The very finest grains, smaller than a crit¬ 
ical size, exhibit an unstable type of magnetic behavior 
called superparamagnetism, with relaxation times typically 
less than 100 s. Above the critical size the uniformly magne¬ 
tized grain is very stable and is called a single domain grain. 

5.3.5.1 Superparamagnetism 

In a ferromagnetic material the strong molecular fields 
keep the atomic spin magnetic moments uniformly aligned 


with each other, and the grain anisotropy requires this 
spontaneous magnetization to lie parallel to an “easy” 
direction. If the temperature is too high, thermal energy 
(kT) may exceed the anisotropy energy (vKJ but still be 
too small to break up the spontaneous magnetization. 
The thermal energy causes the entire magnetic moment of 
the grain to fluctuate coherently in a manner similar to 
paramagnetism (the theory of which applies to individual 
atomic magnetic moments). The grain magnetization 
has no stable direction, and the behavior is said to be 
superparamagnetic. It is important to note that superpara- 
magnetic grains themselves are immobile; only their 
uniform magnetization fluctuates relative to the grain. 
Whether the ferrimagnetic grain exists in a stable or super- 
paramagnetic state depends on the grain size, the grain 
shape (if the origin of K u is magnetostatic) and the tem¬ 
perature. If the grain volume v is very small, unstable 
magnetic behavior due to superparamagnetism becomes 
likely. Magnetite and hematite grains finer than about 
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Fig. 5.17 Ranges of grain sizes and shapes for superparamagnetic, 
single domain and multidomain magnetic behavior in ellipsoidal 
magnetite grains (after Evans and McElhinny, 1969). 

0.03 ptm in diameter are superparamagnetic at room temp¬ 
erature. 


53.5.2 Single domain particles 

When the anisotropic magnetic energy (vKJ of a grain is 
greater than the thermal energy (kT), the spontaneous 
magnetization direction favors one of the easy directions. 
The entire grain is uniformly magnetized as a single 
domain. This situation occurs in very fine grains of ferri- 
magnetic minerals. 

In magnetite K u is the magnetostatic energy related to 
the particle shape. The theoretical range of single domain 
sizes in magnetite is narrow, from about 0.03 to 0.1 |xm in 
equant grains and up to about 1 p,m in elongate grains 
(Fig. 5.17). In hematite K u is the large magnetocrystalline 
anisotropy energy, and the range of single domain sizes is 
larger, from about 0.03 to 15 |xm. 

The magnetization of a single domain particle is very 
stable, because to change it requires rotating the entire 
uniform spontaneous magnetization of the grain against 
the grain anisotropy, which requires a very strong magnetic 
field. The magnetic field required to reverse the direction of 
magnetization of a single domain grain is called its coerciv- 
ity B c and is given by: 



The maximum coercivity of single domain magnetite is 
around 0.3 T for needle-shaped elongate grains. The mag¬ 
netocrystalline anisotropy of hematite gives it higher 
maximum coercivities, in excess of 0.5 T. However, the 
magnetic properties of hematite are very variable and its 
maximum coercivity commonly exceeds 2T. Because of 



Fig. 5.18 Subdivision of (a) the uniform magnetization of a large grain 
into (b) two oppositely magnetized magnetic domains and (c) four 
alternately magnetized domains. 

their stable remanent magnetizations, single domain par¬ 
ticles play a very important role in paleomagnetism. 

53.5.3 Multidomain particles 

Single domain behavior is restricted to a limited range of 
grain sizes. When a grain is large enough, the magnetic 
energy associated with its magnetization becomes too 
large for the magnetization to remain uniform. This is 
because the demagnetizing field of a uniformly magne¬ 
tized grain (Fig. 5.18a) interacts with the spontaneous 
magnetization and generates a magnetostatic (or self- 
demagnetizing) energy. To reduce this energy, the magne¬ 
tization subdivides into smaller, uniformly magnetized 
units, called Weiss domains after P. Weiss, who theoreti¬ 
cally predicted domain structure in 1907. In the simplest 
case the magnetization divides into two, oppositely mag¬ 
netized domains (Fig. 5.18b). The net magnetization is 
reduced to zero, and the magnetostatic energy is reduced 
by about a half. Further subdivision (Fig. 5.18c) reduces 
the magnetostatic energy correspondingly. In a grain with 
n domains of alternately opposed spontaneous magneti¬ 
zations the magnetostatic energy is reduced by l/«. The 
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domains are separated from one another by thin regions, 
about 0.1 p.m thick, that are usually much thinner than the 
domains they divide. These regions are called Bloch 
domain walls in recognition of F. Bloch, who in 1932 pro¬ 
posed a theory for the structure of the domain wall on an 
atomic scale. Within the domain wall the magnetization 
undergoes small progressive changes in direction from 
each atom to its neighbor. The crystalline magnetic 
anisotropy of the material attempts to keep the atomic 
magnetic spins parallel to favored crystalline directions, 
while the exchange energy resists any change of direction 
from parallel alignment by the molecular field. The energy 
of these competing effects is expressed as a domain wall 
energy associated with each unit area of the wall. 

The magnetization of a multidomain grain can be 
changed by moving the position of a domain wall, which 
causes some domains to increase in size and others to 
decrease. A large multidomain grain may contain many 
easily movable domain walls. Consequently, it is much 
easier to change the magnetization of a multidomain 
grain than of a single domain grain. As a result multido¬ 
main grains are less stable carriers of remanent magneti¬ 
zation than single domain grains. 

The transition between single domain and multido¬ 
main behavior occurs when the reduction of magnetosta¬ 
tic energy is balanced by the energy associated with the 
domain wall that has been added. If magnetite grains are 
elongate, they can persist as single domain grains up to 
about 1 |xm (Fig. 5.17). Magnetite grains larger than a 
few micrometers in diameter are probably multidomain. 
In equidimensional grains of magnetite the transition 
should occur in grains of about 0.05-0.1 pun diameter. 
However, at this point the grain is not physically large 
enough to contain a wall, which has a thickness of about 
0.1 pm. Grains in the intermediate range of sizes, and 
those large enough to contain only a few walls, are said to 
carry a pseudo-single domain magnetization. True mul¬ 
tidomain behavior in magnetite is observed when the 
grain size exceeds 15-20 pm. 

In a pseudo-single domain grain that is large enough 
to contain only two domains, the domain wall separating 
them is not able to move freely. Its freedom of movement 
is restricted by interactions with the grain surface. A small 
grain in this size range has more stable magnetic proper¬ 
ties than a multidomain particle but is not as stable as a 
true single domain grain. Magnetite grains between about 
0.1 m and several micrometers in diameter have pseudo¬ 
single domain properties. 

5.3.6 Remanent magnetizations in rocks 

The small concentration of ferrimagnetic minerals in a 
rock gives it the properties of magnetic hysteresis. Most 
important of these is the ability to acquire a remanent 
magnetization (or remanence). The untreated remanence 
of a rock is called its natural remanent magnetization 
(NRM). It may be made up of several components 


acquired in different ways and at different times. The geo¬ 
logically important types of remanence are acquired at 
known times in the rock’s history, such as at the time of its 
formation or subsequent alteration. The remanence of a 
rock can be very stable against change; the high coercivity 
(especially of the fine grains) of the ferrimagnetic mineral 
assures preservation of the magnetic signal during long 
geological epochs. 

A remanence acquired at or close to the time of for¬ 
mation of the rock is called a primary magnetization; 
a remanence acquired at a later time is called a 
secondary magnetization. Examples of primary rema- 
nence are thermoremanent magnetization, which an 
igneous rock acquires during cooling, and the remanent 
magnetizations acquired by a sediment during or soon 
after deposition. Secondary remanences may be caused 
by chemical change of the rock during diagenesis or 
weathering, or by sampling and laboratory procedures. 

5.3.6.1 Thermoremanent magnetization 

The most important type of remanent magnetization in 
igneous (and high-grade metamorphic) rocks is thermore¬ 
manent magnetization (TRM). Igneous rocks solidify at 
temperatures well above 1000 °C. At this temperature the 
grains are solid and fixed in a rigid matrix. The grains of a 
ferrimagnetic mineral are well above their Curie tempera¬ 
ture, which in magnetite is 578 °C and in hematite is 
675 °C. There is no molecular field and the individual 
atomic magnetic moments are free to fluctuate chaoti¬ 
cally; the magnetization is paramagnetic (Fig. 5.19). 

As the rock cools, the temperature eventually passes 
below the Curie temperature of the ferrimagnetic grains 
and a spontaneous magnetization appears. In single 
domain grains the relaxation time of the grain magnetiza¬ 
tion is governed by Eq. (5.29), which can be modified by 
writing the anisotropy energy density K u in terms of the 
spontaneous magnetization M s and coercivity B c of the 
grain. This gives the relaxation time as follows 

_J_ ( vM s B c \ 

T "o eXp \ 2kT ) (5 - 1) 

At high temperature the thermal energy ( kT) is larger 
than the magnetic energy (vM s BJ2) and the magnetiza¬ 
tion is unstable. Although the individual atomic mag¬ 
netic moments are forced by the molecular field to act as 
coherent units, the grain magnetizations are superpara- 
magnetic. As the rock cools further, the spontaneous 
magnetization and the magnetic anisotropy energy K u of 
the grain increase. Eventually the temperature passes 
below a value at which the thermal energy, whose effect 
is to randomize the grain magnetic moments, is no 
longer greater than the magnetic anisotropy energy. The 
spontaneous magnetization then becomes “blocked” 
along an easy direction of magnetization of the grain. In 
the absence of an external magnetic field the grain 
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Fig. 5.19 On cooling through the Curie temperature the magnetic 
state of magnetite grains changes from paramagnetism to 
ferromagnetism. On cooling further the magnetizations in the 
magnetite grains become blocked along easy directions of 
magnetization close to the field direction. The resultant 
thermoremanent magnetization is parallel to the field direction. 


magnetic moments will be randomly oriented (assuming 
the easy axes to be randomly distributed). If the grain 
cools below its blocking temperature in a magnetic field, 
the grain magnetic moment is ‘‘blocked” along the easy 
axis that is closest to the direction of the field at that time 
(Fig. 5.19). The alignment of grain magnetic moments 
with the field is neither perfect nor complete; it repre¬ 
sents a statistical preference. This means that in an 
assemblage of grains more of the grains have their mag¬ 
netic moments aligned close to the field direction than 
any other direction. The degree of alignment depends on 
the strength of the field. 

It is important to note that the mineral grains them¬ 
selves are immobile throughout the process of acquisition 
of TRM. Only the internal magnetizations of the grains 
can change direction and eventually become blocked. The 
blocking temperatures of TRM are dependent on the 
grain size, grain shape, spontaneous magnetization and 
magnetic anisotropy of the ferrimagnetic mineral. If the 
rock contains a wide range of grain sizes and perhaps 
more than a single magnetic mineral, there may be a 
broad spectrum of blocking temperatures. Maximum 
blocking temperatures may range as high as the Curie 
point, and the spectrum can extend to below ambient 
temperature. If the magnetic field is applied only while the 
rock is cooling through a limited temperature range, only 
grains with blocking temperatures in this range are acti¬ 
vated, and a partial TRM (or pTRM) results. 


Mt. Etna lavas 

(Chevalier, 1925) 

360°/0° 



■ direction of magnetic field during eruption 
O direction of magnetization of lava sample 

Fig. 5.20 Agreement of directions of thermoremanent magnetization 
in a basaltic lava flow on Mt. Etna (Sicily) with the direction of the 
geomagnetic field during eruption of the lava (based upon data from 
Chevallier, 1925). 

TRM is a very stable magnetization which can exist 
unchanged for long intervals of geological time. The 
ability of TRM to record accurately the field direction is 
demonstrated by the results from a lava that erupted on 
Mt. Etna at a time for which a record of the magnetic field 
direction is available from observatory data. The direc¬ 
tions of the TRM in the lava samples are the same as the 
direction of the ambient field (Fig. 5.20). 

53.6.2 Sedimentary remanent magnetizations 

The acquisition of depositional remanent magnetization 
(DRM) during deposition of a sediment takes place at 
constant temperature. Magnetic and mechanical forces 
compete to produce a physical alignment of detrital ferri¬ 
magnetic particles. During settling through still water 
these particles are oriented by the ambient magnetic field 
in the same way that it orients a compass needle. The par¬ 
ticles become aligned statistically with the Earth’s mag¬ 
netic field (Fig. 5.21). The action of mechanical forces 
may at times spoil this alignment. Water currents cause 
hydromechanical forces that disturb the alignment during 
settling, giving rise to a declination error. On contact with 
the bottom of the sedimentary basin, the mechanical 
force of gravity rolls the particle into a stable attitude, 
causing an inclination error. The pressure of overlying 
sediment during deep burial results in compaction, which 
can produce further directional errors. The DRM is 
finally fixed in sedimentary rocks during diagenesis. 

A modified form of post-depositional remanence 
(pDRM) is important in fine grained sediments. A water¬ 
logged slurry forms at the sediment-water interface. Fine 
grained magnetic minerals in the water-filled pore spaces in 
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Fig. 5.21 Acquisition of depositional remanent magnetization (DRM) 
in a sediment; gravity causes an inclination error between the 
magnetization and field directions. 

the sediment are partially in suspension and may be reori¬ 
ented by the magnetic field, if they are free enough to move 
(Fig. 5.22a). The energy for this motion of the particles is 
obtained from the Brownian motion of the water mole¬ 
cules, which continuously and randomly collide with the 
particles in the pore spaces. Large particles are probably in 
contact with the surrounding grains and are unaffected by 
collisions with the water molecules. Very fine particles that 
are virtually floating in the pore spaces may acquire 
enough freedom from the Brownian agitation to align sta¬ 
tistically with the Earth’s magnetic field. Laboratory exper¬ 
iments have established that the direction of pDRM is an 
accurate record of the depositional field, without inclina¬ 
tion error (Fig. 5.22b). The pDRM is acquired later than 
the actual time of sedimentation, and is fixed in the sedi¬ 
ment during compaction and de-watering at a depth of 
— 10 cm. This may represent a lock-in time delay of lOOyr 
in lacustrine sediments or 10,000 yr in pelagic marine sedi¬ 
ments, where it is no more important geologically than the 
errors involved in locating paleontological stage bound¬ 
aries. The pDRM process is particularly effective in fine 
grained sediments containing strongly magnetic magnetite 
grains. For example, pDRM is the most important mecha¬ 
nism of primary magnetization in pelagic limestones. 
Compaction may cause a flattening of the inclination 
under some conditions. 

Bioturbation may mix the sediment, typically to a depth 
of about 10 cm in pelagic sediments. This affects the posi¬ 
tions of stratigraphic marker levels. First occurrence datum 
levels of fossils are carried deeper, and last occurrences are 
carried higher than the true stratigraphic levels. Agitation 




Inclination of field 

Fig. 5.22 (a) Post-depositional remanent magnetization (pDRM) is 
acquired by reorientation of ferromagnetic grains in the pore spaces of 
a deposited sediment, (b) Comparison of the pDRM inclination with the 
field inclination in a redeposited deep-sea sediment (after Irving and 
Major, 1964). 

of bioturbated sediment by the burrowing organisms 
assists the Brownian motion of the magnetic particles. 
Under these conditions the pDRM is acquired at the base 
of the bioturbated zone. 

53.6.3 Chemical remanent magnetization 

Chemical remanent magnetization (CRM) is usually a 
secondary form of remanence in a rock. It occurs when 
the magnetic minerals in a rock suffer chemical alteration 
or when new minerals form authigenically. An example is 
the precipitation of hematite from a goethite precursor or 
from iron-saturated fluids that pass through the rock. The 
magnetic minerals may also experience diagenetic modifi¬ 
cation or oxidation by weathering, which usually happens 
on the grain surface and along cracks (Fig. 5.23). The 
growth of a new mineral (or the alteration of an existing 
one) involves changes in grain volume v, spontaneous 




302 Geomagnetism and paleomagnetism 


hematite 



Fig. 5.23 Acquisition of chemical remanent magnetization (CRM) 
accompanies the diagenetic modification or oxidation by weathering of 
magnetic minerals; this often happens on the grain surface and along 
cracks. 

magnetization M s and coercivity B c . The chemical change 
affects the relaxation time of the grain magnetization, 
according to Eq. (5.31). The grains eventually grow 
through a critical volume, at which the grain magnetiza¬ 
tion becomes blocked. The new CRM is acquired in the 
direction of the ambient field during the chemical change, 
and so it is younger than the host rock. It has stable mag¬ 
netic properties similar to TRM. A common example is 
the formation of hematite during diagenesis or weather¬ 
ing. Hematite that originates in this way typically carries a 
secondary remanent magnetization. 

53.6.4 Isothermal remanent magnetization 

Isothermal remanent magnetization (IRM) is induced in 
a rock sample by placing it in a magnetic field at constant 
temperature. For example, rock samples are exposed to 
the magnetic fields of the sampling equipment and to 
other magnetic fields during transport to the laboratory. 
A common technique of rock magnetic analysis consists 
of deliberately inducing IRM via a known magnetic field 
produced in a large coil or between the poles of an elec¬ 
tromagnet. The magnetic moments within each grain are 
partially aligned by the applied field. The degree of align¬ 
ment depends on the field strength and on the resistance 
of the magnetic mineral to being magnetized, loosely 
referred to as its coercivity. After removing the sample 
from the applied field an IRM remains in the rock (see 
Fig. 5.11). If the rock sample is placed in progressively 
stronger fields the IRM increases to a maximum value 
called the saturation IRM, which is determined by the 
type and concentration of the magnetic mineral. The 
shape of the progressive acquisition curve and the field 
needed to reach saturation IRM depend on the coercivi- 
ties of the magnetic minerals in the rock (Fig. 5.24). 

The maximum coercivities of the most common ferro¬ 
magnetic minerals in rocks are fairly well known (Table 
5.1). These minerals also have distinctive maximum block¬ 
ing temperatures (Section 5.3.6.1). The combination of 


Table 5.1 Maximum coercivities and blocking 
temperatures for some common ferromagnetic minerals 


Ferromagnetic 

mineral 

Maximum 

coercivity 

[T] 

Maximum 

blocking 

temperature 

[°C] 

Magnetite 

0.3 

575 

Maghemite 

0.3 

—350 

Titanomagnetite (Fe 3 
x = 0.3 

TiA): 

0.2 

350 

x = 0.6 

0.1 

150 

Pyrrhotite 

0.5-1 

325 

Hematite 

1.5-5 

675 

Goethite 

>5 

80-120 




Fig. 5.24 Examples of the identification of magnetic minerals by 
acquisition and subsequent thermal demagnetization of IRM. Hematite 
is present in both (a) and (b), because saturation IRM requires fields 
> 1 T and thermal demagnetization of the hard fraction persists to 
T~ 675 °C. In (a) the soft fraction that demagnetizes at T= 575 °C is 
magnetite, while in (b) no magnetite is indicated but pyrrhotite is 
present in all three fractions, shown by thermal unblocking at 
T = 300-330 °C (after Lowrie, 1990). 

these properties provides a method of identification of the 
predominant magnetic minerals in rocks. Starting with the 
strongest field available, IRM is imparted in successively 
smaller fields, chosen to remagnetize different coercivity 
fractions, along two or three orthogonal directions. The 
compound IRM is then subjected to progressive thermal 
demagnetization. The demagnetization characteristics of 
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Fig. 5.25 Viscous remanent magnetization (VRM) in pelagic limestone 
samples, showing logarithmic growth with increasing time (after Lowrie 
and Heller, 1982). 

the different coercivity fractions help to identify the mag¬ 
netic mineralogy (Fig. 5.24). 

53.6.5 Other remanent magnetizations 

If a rock containing magnetic minerals with unstable mag¬ 
netic moments experiences a magnetic field, there is a finite 
probability that some magnetic moments opposite to the 
field may switch direction to be parallel to the field. As time 
goes on, the number of magnetic moments in the direction 
of the field increases and the magnetization grows loga¬ 
rithmically with time (Fig. 5.25). The time-dependent 
remanence acquired in this way is called a viscous remanent 
magnetization (VRM). The VRM also decreases logarith¬ 
mically when the field is removed or changed and is often 
identifiable during laboratory analysis as an unstable time- 
dependent change in remanence. The direction of a VRM 
is often parallel to the present-day field direction, which 
can be useful in its identification. However, it is always a 
secondary remanence and it can mask the presence of geo¬ 
logically interesting stable components. Techniques of pro¬ 
gressive demagnetization have been designed to remove 
VRM and IRM, effectively “cleaning" the remanence of a 
rock of undesirable components. 

An important form of remanent magnetization can be 
produced in a rock sample by placing it in a coil that carries 
an alternating magnetic field, whose amplitude is then 
slowly reduced to zero. In the absence of another field, this 
procedure randomizes the orientations of grain magnetic 
moments with coercivities less than the peak field. If, 
however, the rock sample is exposed to a small constant 
magnetic field while the amplitude of the alternating mag¬ 
netic field is decreasing to zero, the magnetic moments are 
not randomized. Their distribution is biassed with a statis¬ 
tical preference for the direction of the constant field. This 
produces an anhysteretic remanent magnetization (ARM) 
in the sample. The intensity of ARM increases with the 
amplitude of the alternating field, and also with the 


strength of the constant bias field. ARM may be produced 
deliberately, as described, and it is commonly observed as a 
spurious effect during progressive alternating field demag¬ 
netization of rock samples when the shielding from exter¬ 
nal fields is imperfect (Section 5.6.3.2). 

Stress and the associated strains of magnetostriction 
combine to produce magnetoelastic energy (see Section 
5.2.7.3). This acts as a source of magnetic anisotropy that 
can modify the easy directions of magnetization in some 
magnetic minerals. As a result, stress caused by tectonic 
deformation or by defects in the crystal structure of con¬ 
stituent minerals may influence the direction of magnetiza¬ 
tion in a rock. The effect of high hydrostatic pressure, such 
as might be encountered at depth in the Earth’s crust, is to 
deflect the magnetizations of individual grains away from 
existing easy axes, which leads to a net demagnetization of 
the rock. On the other hand, the effect of non-hydrostatic 
stress in the presence of a magnetic field is to produce a 
piezoremanent magnetization (PRM) that is added to, or 
may replace, any pre-existing remanent magnetization. The 
modification of rock magnetization by deformational 
stress, whether elastic or plastic, can overprint the original 
magnetization in rocks that have been deeply buried, or sub¬ 
jected to high stress and temperature during deformation. 

A related type of magnetization has been observed in 
terrestrial and lunar rocks that have been shocked by mete- 
oritic impact. The event causes very high local stress of 
short duration, and gives rise to a shock remanent magneti¬ 
zation (SRM) by a similar mechanism to PRM. The colli¬ 
sion of a meteorite with another planetary body imposes a 
characteristic shock texture on the impacted rocks, so that 
SRM and PRM are not exactly equivalent. Moreover, much 
of the energy of the collision is released as heat, which can 
modify and reduce the SRM over a much longer time than 
the brief duration of the impact. However, SRM may have 
played a role in the origin of anomalous crustal magnetiza¬ 
tions on the Moon, Mars and other extraterrestrial bodies 
that have suffered extensive meteoritic bombardment. 

5.3.7 Environmental magnetism 

Since the mid-1980s rock magnetic properties have found 
important new applications in environmental research, 
where they can serve as tracers for pollution or as indica¬ 
tors of past climates. Heavy metals from industrial 
processes enter the environment, contaminating both the 
ground surface and the water in lakes, rivers and the 
groundwater. Magnetic minerals such as magnetite and 
hematite accompany more toxic elements in the emis¬ 
sions. Magnetic susceptibility surveys can provide a rapid 
method of determining the geographic dispersal and rela¬ 
tive concentration of the heavy metal pollution. Motor- 
vehicle exhaust emissions also pollute the environment by 
loading it with heavy metals as well as sub-micrometer 
sized particles of soot, a fraction of which consists of 
nanoparticle sized magnetic minerals. Rock magnetic 
parameters and investigative techniques provide ways of 
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Fig. 5.26 Comparison of the 
magnetic susceptibility 
variation in a section of loess 
and paleosols atXifeng, 
China, with the oxygen 
isotope record in marine 
sediments cored at ODP site 
677 (after Evans and Heller, 
2003). 
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characterizing these particles and of describing their 
regional distribution and concentration. 

The magnetic grain fraction in a sediment or soil usually 
consists of hematite, magnetite, maghemite or an iron 
sulfide. The magnetic mineral composition may be modi¬ 
fied by the climatic conditions at or after deposition. These 
affect magnetic properties of the sediment that depend on 
grain size and mineral composition, such as magnetic sus¬ 
ceptibility, coercivity and the various remanent magnetiza¬ 
tions. These magnetic parameters can act as proxies for 
paleoclimatic change. Numerous studies have been devoted 
to understanding the processes involved and their applica¬ 
tion. The following example illustrates the potential of 
magnetic methods for analyzing a paleoclimatic problem. 

Great thicknesses of loess sediments up to 100 m in 
thickness occur in Central China. The loess are very fine 
grained (10-50 pm size) wind-blown sediments that are 
deposited in cold, dry conditions. Alternating with the 
beds of loess are layers of paleosol, the name given to 
fossil soils. These form by conversion of the loess to soils 
(a process called pedogenesis), which takes place during 
interglacial periods of warmer, humid conditions. In 
turn, later loess deposits bury the soils. The alternation of 
loess and paleosol is therefore a record of past climatic 
variations. The magnetic susceptibility correlates with the 
lithology of the loess-paleosol sequences. In the loess 
beds the susceptibility measures around 25 SI units, but in 
the paleosol layers the values exceed 200 SI units (Fig. 
5.26). Rock magnetic analysis showed that the dominant 
magnetic mineral in both lithologies is magnetite. The 
higher susceptibility of the paleosols is thus due to higher 
concentrations of magnetite. The susceptibility variation 
correlates very well with the oxygen isotope stratigraphy 
measured in deep-sea sediments at site 677 of the Ocean 
Drilling Program (ODP). The oxygen isotope ratio (see 
Box 5.2) is a record of climatic variation, with the most 
negative values corresponding to warmer temperature, 


and it has been dated by comparison to magnetostratigra- 
phy in marine sediments (Section 5.7.2). The correlation 
in Fig. 5.26 provides a way of dating the loess-paleosol 
sequence and shows that the paleosols were formed under 
warmer conditions than the loess. Possibly the change to 
a warmer, humid climate encouraged the in situ formation 
of a new phase of magnetite in the paleosols, with a con¬ 
sequent increase in susceptibility. 

The new phase of magnetite in the paleosols originates 
by a chemical process. However, magnetite can be produced 
by the action of bacteria in sediments deposited in lakes 
and in the sea. This is referred to as a biogenic process and 
the bacteria that can produce magnetite are called magneto¬ 
tactic bacteria. The magnetite forms as tiny crystallites 
called magnetosomes, less than 0.1 |xm in size and thus of 
single domain type, which occur as chains of particles 
enclosed in a membrane. The most common magnetosome 
mineral is magnetite, but greigite, an iron sulfide with struc¬ 
ture similar to magnetite, has also been found. The chain¬ 
like assembly imparts a dipole magnetic moment to the 
bacteria. This is evidently an evolutionary feature, which 
helps the bacteria to survive. If the sediment is disturbed, 
the magnetic moment of the bacteria enables them to move 
along the Earth's magnetic field lines. The dipping field 
lines guide them back down into the sediment where they 
find nutrition needed for their survival. It was long believed 
that magnetite in deep-sea sediments was mainly of detrital 
origin, washed in from the continents and ocean ridges, but 
it is now clear that a large portion must be biomagnetic. 

Magnetite of biogenic origin has been identified in many 
unusual settings. For example, submicroscopic magnetite 
occurs in the brains of dolphins and birds; whether it plays a 
role in their guidance during migration is unclear. 
Magnetite of nanometer size has been located in the human 
brain, where it may be related to neurological disorders such 
as epilepsy, Alzheimer's disease and Parkinson's disease. It 
also occurs in other human organs. The magnetic properties 
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Box 5.2: Oxygen isotope ratio 


The element oxygen has two important stable isotopes. 
These are a “light” isotope 16 0, with 8 protons and 8 
neutrons in its nucleus, and a “heavy” isotope 18 0, with 
8 protons and 10 neutrons. Mass spectrometers are 
capable of determining accurately the masses of these 
isotopes, and so the mass ratio 18 0/ 16 0 can be measured 
in quite small samples. In order to describe the deviation 
of the 18 0/ 16 0 ratio in a given sample from a standard 
value, a useful parameter, 8 ls O, is defined as follows: 

8 1S 0 _ 1000 x ( ( 18 Q /16 0)sample ~ ( 18 Q/ 16 0) stand ard \ 

v ( 18 o/-o) standard J 

The standard value is taken to be the 18 0/ 16 0 ratio in 
the modern oceans at depths in the 200-500 m range, or 
alternatively the ratio in a fossil belemnite known as 
PDB. The factor 1000 causes 8 18 0 values to be 
expressed in parts per thousand. The global climate at 
present is relatively warmer than in the past, so 18 0/ 16 0 
ratios measured in ancient samples tend to be smaller 
than the standard value and give negative 8 18 0 values. 

The 18 0/ 16 0 ratio in water is dependent on tempera¬ 
ture, as evident in the plot of 8 ls O in the annual precipi¬ 
tation against the mean temperature at a global set of 
sites (Fig. B5.2). Two factors determine this correlation: 
cool air holds less moisture than warm air, and the 
“heavy” isotope ls O condenses more easily than the 
“light” isotope 16 0. From an air mass that moves pole- 
wards, precipitation in warmer regions is relatively rich 
in 18 0 whereas in colder regions it is enriched in 16 0. 

The situation is reversed in the oceans. During global 
warm periods, polar ice sheets melt, adding fresh water 
enriched in 16 0 to the oceans. As a result, low (i.e., large 
negative) 8 ls O values in the oceans indicate intervals of 
global warming. In contrast, during glacial intervals 
water is transferred from the oceans to the ice sheets 



Fig. B5.2 Observed correlation between d 18 0 in precipitation and 
temperature for the present-day climate (after Jouzel eta/., 1994). 

primarily as 16 0, so the remaining ocean water is 
enriched in 18 0, which increases the 8 18 0 value. Thus, 
high (i.e., small negative or even positive) 8 ls O values in 
oceanic water and sediments indicate cool intervals. 
Conversely, the same conditions result in the opposite 
relationships between global temperature and S 18 0 
values measured in polar ice. 

Paleoclimatology uses the 8 ls O parameter as a guide 
to past climates, as illustrated by the use of the record in 
sediment cores from the Ocean Drilling Project to inter¬ 
pret the paleoclimate that affected Chinese loess-pale- 
osol profiles (Fig. 5.26). Samples can be used from a 
variety of sources: polar ice cores, ocean sediments, and 
fossil shells. In the last case, biological and chemical 
processes disturb the simple relationship with tempera¬ 
ture, but this can be taken into account and corrected. 


of the magnetite, its source and the mechanism of its forma¬ 
tion in human tissue are exciting fields of modern research. 

5.4 GEOMAGNETISM 

5.4.1 Introduction 

The magnetic field of the Earth is a vector, that is, it has 
both magnitude and direction. The magnitude, or intensity 
F, of the field is measured in the same units as other B- 
fields, namely in tesla (see Eq. (5.10)). However, a tesla is 
an extremely strong magnetic field, such as one would 
observe between the poles of a powerful electromagnet. 
The Earth’s magnetic field is much weaker; its maximum 
intensity is reached near to the magnetic poles, where it 
amounts to about 6X10 -5 T. Modern instruments for 


measuring magnetic fields (called magnetometers) have a 
sensitivity of about 10 -9 T; this unit is called a nanotesla 
(nT) and has been adopted in geophysics as the practical 
unit for expressing the intensity of geomagnetic field inten¬ 
sity. There is a practical reason for adopting this unit. Most 
geomagnetic surveys carried out until the 1970s used the 
now abandoned c.g.s. system of units, in which the B-field 
was measured in gauss, equivalent to 10 -4 T. The practical 
unit of geophysical exploration was then 10 -5 gauss, called 
a gamma (y). Thus, the former unit (y) is conveniently 
equal to 10 -9 T, which is the new unit (nT). 

The magnetic vector can be expressed as Cartesian 
components parallel to any three orthogonal axes. The 
geomagnetic elements are taken to be components parallel 
to the geographic north and east directions and the verti¬ 
cally downward direction (Fig. 5.27). Alternatively, the 
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Fig. 5.27 Definition of the geomagnetic elements. The geomagnetic 
field can be described by north (X), east (VO and vertically downward (Z) 
Cartesian components, or by the angles of declination (D) and 
inclination (/) together with the total field intensity (F). 

geomagnetic elements can be expressed in spherical polar 
coordinates. The magnitude of the magnetic vector is 
given by the field strength F; its direction is specified by 
two angles. The declination D is the angle between the 
magnetic meridian and the geographic meridian; the incli¬ 
nation I is the angle at which the magnetic vector dips 
below the horizontal (Fig. 5.27). The Cartesian {X, Y, Z) 
and spherical polar ( F , D, I) sets of geomagnetic elements 
are related to each other as follows: 

X= FcosIcosD T=Fcos/sinD Z = Fsin/ 

F 2 = X 2 + Y 2 + Z 2 


This is a rather formidable expression, but fortunately 
the most useful terms are quite simple. The summation 
signs indicate that the total potential is made up of an 
infinite number of terms with different values of n and /. 
We will only pay attention here to the few terms for which 
n = 1. The expression in parentheses describes the varia¬ 
tion of the potential with distance r. For each value of n 
there will be different dependences (e.g., on r, r 2 , r 3 , r~ 2 , 
r -3 , etc.). The function I^(0,<)>) describes the variation of 
the potential when r is constant, i.e., on the surface of a 
sphere. It is called a spherical harmonic function , because 
it has the same value when 6 or <J> is increased by an inte¬ 
gral multiple of 2 tt (Box 2.3). For observations made on 
the spherical surface of the Earth, the constants A n 
describe parts of the potential that arise from magnetic 
field sources outside the Earth, which are called the geo¬ 
magnetic field of external origin. The constants B n 
describe contributions to the magnetic potential from 
sources inside the Earth. This part is called the geomag¬ 
netic field of internal origin. 

The potential itself is not measured directly. The geo¬ 
magnetic elements X, Y and Z (Fig. 5.27) are recorded at 
magnetic observatories. Ideally these should be distrib¬ 
uted uniformly over the Earth’s surface but in fact they are 
predominantly in the northern hemisphere. The geomag¬ 
netic field components are directional derivatives of the 
magnetic potential and depend on the same coefficients A n 
and B n . Observations of magnetic field elements at a large 
number of measurement stations with a world-wide distri¬ 
bution allows the relative importance of A n and B n to be 
assessed. From the sparse data-set available in 1838 Gauss 
was able to show that the coefficients A n are very much 
smaller than B n . He concluded that the field of external 
origin was insignificant and that the field of internal origin 
was predominantly that of a dipole. 


D = arctanl 


Y 

X 


I — arctan 


\sjX 2 + Y 2 


(5.32) 


5.4.2 Separation of the magnetic fields of external and 
internal origin 

The magnetic field B and its potential W at any point can be 
expressed in terms of the spherical polar coordinates (r, 6, (j>) 
of the point of observation. Gauss expressed the potential 
of the geomagnetic field as an infinite series of terms involv¬ 
ing these coordinates. Essentially, his method divides the 
field into separate components that decrease at different 
rates with increasing distance from the center of the Earth. 
The detailed analysis is complicated and beyond the scope 
of this text. The magnitude of the potential is given by 

W= R%A n r" + W) ( 5 - 33 > 

71=1 \ ' / 1 = 0 


5.4.3 The magnetic field of external origin 

The magnetic field of the Earth in space has been mea¬ 
sured from satellites and spacecraft. The external field has 
a quite complicated appearance (Fig. 5.28). It is strongly 
affected by the solar wind, a stream of electrically charged 
particles (consisting mainly of electrons, protons and 
helium nuclei) that is constantly emitted by the Sun. The 
solar wind is a plasma. This is the physical term for an 
ionized gas of low particle density made up of nearly 
equal concentrations of oppositely charged ions. At the 
distance of the Earth from the Sun (1 AU) the density of 
the solar wind is about 7 ions per cm 3 , and it produces a 
magnetic field of about 6nT. The solar wind interacts 
with the magnetic field of the Earth to form a region 
called the magnetosphere. At distances greater than a few 
Earth radii the interaction greatly alters the magnetic field 
from that of a simple dipole. 

The velocity of the solar wind relative to the Earth is 
about 450 km s _1 . At a great distance (about 15 Earth radii) 
from the Earth, on the day side, the supersonic solar wind 


where R is the Earth’s radius. 
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Fig. 5.28 Schematic cross-section through the magnetosphere, 
showing various regions of interaction of the Earth's magnetic field 
with the solar wind. 


collides with the thin upper atmosphere. This produces an 
effect similar to the build-up of a shock wave in front of a 
supersonic aircraft. The shock front is called the bow-shock 
region (Fig. 5.28); it marks the outer boundary of the mag¬ 
netosphere. Within the bow-shock region the solar wind is 
slowed down and heated up. After passing through the 
shock front the solar wind is diverted around the Earth in a 
region of turbulent motion called the magneto sheath. The 
moving charged particles of the solar wind constitute elec¬ 
trical currents. They produce an interplanetary magnetic 
field , which reinforces and compresses the geomagnetic 
field on the day side and weakens and stretches it out on the 
night side of the Earth. This results in a geomagnetic tail, 
or magnetotail, which extends to great distances “down¬ 
wind’" from the Earth. The Moon’s distance from the Earth 
is about 60 Earth radii and so its monthly orbit about the 
Earth brings it in and out of the magnetotail on each 
circuit. The transition between the deformed magnetic field 
and the magnetosheath is called the magnetopause. 

5.4.3.1 The Van Allen radiation belts 

Charged particles that penetrate the magnetopause are 
trapped by the geomagnetic field lines and form the Van 
Allen radiation belts. These constitute two doughnut¬ 
shaped regions coaxial with the geomagnetic axis (Fig. 
5.29). The inner belt contains mainly protons, the outer 
belt energetic electrons. Within each belt the charged par¬ 
ticles move in helical fashion around the geomagnetic field 
lines (Fig. 5.30). The pitch of the spiraling motion gets 
smaller as the particle comes ever closer to the Earth and 
the field intensity increases; eventually it reaches zero and 
reverses sense. This compels the particles to shuttle rapidly 
from one polar region to the other along the field lines. 
The inner Van Allen belt starts about 1000 km above the 
Earth’s surface and extends to an altitude of about 


3000km (Fig. 5.29); the outer belt occupies a doughnut 
shaped region at distances between about 3 and 4 Earth 
radii (20,000-30,000 km) from the center of the Earth. 

5.4.32 The ionosphere 

The effects described above illustrate how the Earth’s mag¬ 
netic field acts as a shield against much of the extra-terres¬ 
trial radiation. The atmosphere acts as a protective blanket 
against the remainder. Most of the very short-wavelength 
fraction of the solar radiation that penetrates the atmos¬ 
phere does not reach the Earth’s surface. Energetic y- and 
x-rays and ultraviolet radiation cause ionization of the 
molecules of nitrogen and oxygen in the thin upper atmos¬ 
phere at altitudes between about 50 km and 1500 km, 
forming an ionized region called the ionosphere. It is 
formed of five layers, labelled the D, E, Fj, F 2 and G layers 
from the base to the top. Each layer can reflect radio waves. 
The thicknesses and ionizations of the layers change 
during the course of a day; all but one or two layers on the 
night side of the Earth disappear while they thicken and 
strengthen on the day side (Fig. 5.31). A radio transmitter 
on the day side can bounce signals off the ionosphere that 
then travel around the world by multiple reflections 
between the ground surface and the ionosphere. Conse¬ 
quently, radio reception of distant stations from far across 
the globe is best during the local night hours. The D layer is 
closest to the Earth at an altitude of about 80-100 km. It 
was first discovered in 1902, before the nature of the ionos¬ 
phere was known, because of its ability to reflect long- 
wavelength radio waves, and is named the Kennelly- 
Heaviside layer in honor of its discoverers. The E layer is 
used by short-wave amateur radio enthusiasts. The F layers 
are the most intensely ionized. 

5.43.3 Diurnal variation and magnetic storms 

The ionized molecules in the ionosphere release swarms of 
electrons that form powerful, horizontal, ring-like electri¬ 
cal currents. These act as sources of external magnetic 
fields that are detected at the surface of the Earth. The 
ionization is most intense on the day side of the Earth, 
where extra layers develop. The Sun also causes atmos¬ 
pheric tides in the ionosphere, partly due to gravitational 
attraction but mainly because the side facing the Sun is 
heated up during the day. The motions of the charged par¬ 
ticles through the Earth’s magnetic field produce an elec¬ 
trical field, according to Lorentz’s law (Eq. (5.10)), which 
drives electrical currents in the ionosphere. In particular, 
the horizontal component of particle velocity interacts 
with the vertical component of the geomagnetic field to 
produce horizontal electrical current loops in the ionos¬ 
phere. These currents cause a magnetic field at the Earth’s 
surface. As the Earth rotates beneath the ionosphere the 
observed intensity of the geomagnetic field fluctuates 
with a range of amplitude of about 10-30nT at the 
Earth’s surface and a period of one day (Fig. 5.32a). This 
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Fig. 5.29 Schematic 
representation of the inner 
and outer Van Allen belts of 
charged particles trapped by 
the magnetic field of the 
Earth. 


dipole 




Fig. 5.30 Charged particles from the solar wind are constrained to 
move in a helical fashion about the geomagnetic field lines (after 
Vestine, 1962). 



Fig. 5.31 Cross-section through the Earth showing the layered 
structure of the ionosphere (after Strahler, 1963). 
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Fig. 5.32 (a) The time-dependent daily (or diurnal) variation of the 
components of geomagnetic field intensity at different latitudes (after 
Chapman and Bartels, 1940), and (b) the variation of horizontal field 
intensity during a magnetic storm (after Ondoh and Maeda, 1962). 


time-dependent change of geomagnetic field intensity is 
called the diurnal (or daily) variation. 

The magnitude of the diurnal variation depends on the 
latitude at which it is observed. Because it greatly exceeds 
the accuracy with which magnetic fields are measured 
during surveys, the diurnal variation must be compensated 
by correcting field measurements accordingly. The intensity 
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of the effect depends on the degree of ionization of the 
ionosphere and is therefore determined by the state of solar 
activity. As described in Section 5.4.7.1, the solar activity is 
not constant. On days when the activity of the Sun is espe¬ 
cially low, the diurnal variation is said to be of solar quiet 
(S ) type. On normal days, or when the solar activity is high, 
the S q variation is overlaid by a solar disturbance (S D ) vari¬ 
ation. Solar activity changes periodically with the 11-yr 
cycle of sunspots and solar flares. The enhanced emission 
of radiation associated with these solar phenomena 
increases the ionospheric currents. These give rise to rapidly 
varying, anomalously strong magnetic fields (called mag¬ 
netic storms) with amplitudes of up to 1000 nT at the 
Earth’s surface (Fig. 5.32b). The ionospheric disturbance 
also disrupts short-wave to long-wave radio transmissions. 
Magnetic surveying must be suspended temporarily while a 
magnetic storm is in progress, which can last for hours or 
days, depending on the duration of the solar activity. 

5.4.4 The magnetic field of internal origin 

To understand how geophysicists describe the geomagnetic 
field of internal origin mathematically we return to 
Eq. (5.33). First, we follow Gauss and omit the coefficients 
A of the external field. The spherical harmonic functions 
Y^{6,<J>) that describe the variation of potential on a spher¬ 
ical surface are then written in expanded form (Box 2.3). 
The potential W of the field of internal origin becomes 


n = o° m = n 




i+i 


(g"‘cos m<f) 

■ h% sin (cos 0) 


(5.34) 


Here, R is the Earth’s radius, as before, and R"'(cos0) are 
called Schmidt polynomials, which are related to the asso¬ 
ciated Legendre polynomials (Box 2.2). 

Equation (5.34) is a multipole expression of the geo¬ 
magnetic potential. It relates the potential of the mea¬ 
sured field to the potentials of particular combinations of 
magnetic poles (Box 5.3). The constants g'" and If" in the 
geomagnetic potential are called the Gauss (or Gauss- 
Schmidt) coefficients of order n and degree m. Inspection 
of Eq. (5.34) shows that they have the same dimensions 
(nT) as the B-field. Their values are computed from 
analysis of measurements of the geomagnetic field. 

Data from several sources are integrated in modern 
analyses of the Earth’s magnetic field. Until the dawn of 
the satellite era, continuous records at magnetic observa¬ 
tories were the principal sources of geomagnetic data. 
Average values of the geomagnetic elements were deter¬ 
mined, from which optimum values for the Gauss 
coefficients could be derived. Currently, about 200 perma¬ 
nent observatories make continuous measurements of the 
field. However, satellites orbiting the Earth in low near- 
polar orbits now provide most of the high-quality data 
used to model the field. The Polar Orbiting Geophysical 


Observatory (POGO), launched in 1965, was the first to 
deliver field measurements, but the greatest advance came 
with the Magnetic Field Satellite (MAGSAT), which 
delivered high-quality data during a six-month mission in 
1979-1980. In 1979, a Danish satellite, 0RSTED, was 
placed in an elliptical, low-polar orbit, with altitude 
between 650 km and 865 km. It carried a vector magne¬ 
tometer and a total field magnetometer, each with a sensi¬ 
tivity of 0.5 nT; the mission was dedicated to a survey of 
the geomagnetic field. The German satellite CHAMP (see 
also Section 2.4.6.4), was launched in 2000 in a lower, 
near-circular orbit on a planned five-year mission. In 
addition to measuring the gravity field, this satellite 
was also equipped to make scalar and vector measure¬ 
ments of the magnetic field. The low orbit and improved 
instrumentation were designed to provide magnetic mea¬ 
surements with high resolution. 

In principle, an infinite number of Gauss coefficients 
would be needed to define the field completely. The 
coefficients of order and degree 8 and higher are very small 
and the calculation of Gauss coefficients must usually be 
truncated. A global model of the field is provided by the 
International Geomagnetic Reference Field (IGRF), 
which is based on coefficients up to n = 10, although analy¬ 
ses of higher order have been made. It is updated at regular 
intervals. The IGRF also gives the rate of change of each of 
the Gauss coefficients (its secular variation), which permits 
correction of the current values between update years. 

The Gauss coefficients get smaller with increasing 
order /?; this decrease provides a way of estimating the 
origin of the internal field. The analysis involves a tech¬ 
nique called power spectral analysis. The distance across a 
feature of the magnetic field (for example, a region where 
the field is stronger than average) is called the wavelength 
of the feature. As in the case of gravity anomalies, deep- 
seated magnetic sources produce broad (long-wave- 
length) magnetic anomalies, while shallow sources result 
in narrow (short-wavelength) anomalies. Spectral analysis 
consists of calculating the power (alternatively called 
the energy density) associated with each “frequency” in 
the signal. This is obtained by computing the sum of the 
squares of all coefficients with the same order. In the case 
of the geomagnetic field, the spectral analysis is based on 
the values of the Gauss coefficients. The spatial frequency 
of any part of the observed field is contained in the order 
n of the coefficients. Low-order terms (those with small 
values of n) correspond to long-wavelength features, 
high-order terms are related to short-wavelength features. 

Measurements of the geomagnetic field from the 
MAGSAT Earth-orbiting satellite at a mean altitude of 
420 km above the Earth’s surface have been analyzed to 
yield Gauss coefficients to order n = 66, special techniques 
being invoked for orders n> 29. A plot of the energy 
density associated with each order n of the geomagnetic 
field shows three distinct segments (Fig. 5.33). The high- 
frequency terms of order n > 40 are uncertain and the 
terms with n > 50 are in the “noise level” of the analysis 
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Box 5.3: Multipole representation of the geomagnetic field 


Each term in Eq. (5.34) is the potential of the magnetic 
field due to a particular combination of magnetic poles. 
For example, Eq. (5.34) contains three terms for which n 
= 1. Each corresponds to a dipole field. One dipole is 
parallel to the axis of rotation of the Earth and the 
other two are at right angles to each other in the equato¬ 
rial plane. The five terms with n = 2 describe a geometri¬ 
cally more complex field, known as a quadrupole field. 
Just as a dipole field results when two single poles are 
brought infinitesimally close to each other (see Section 
5.2.3), a quadrupole field results when two opposing 
coaxial dipoles are brought infinitesimally close end-to- 
end. As its name implies, the quadrupole field derives 
from four magnetic poles of which two are “north” 
poles and two are “south” poles. The terms with n = 3 
describe an octupole field which is characterized by 
eight (2 3 ) poles. The terms with n = N describe a field 
that arises from a configuration of 2 N poles. 

The configurations of axial dipole, quadrupole and 
octupole fields relative to a reference sphere are illustrated 
in Fig. B5.3.1 for the case where the order m of the Gauss 
coefficients is zero. If m = 0 in Eq. (5.34), the potential 
does not vary around a circle of “latitude” defined by a 
chosen combination of 6 and r. This kind of field is said to 
have zonal symmetry. Any cross-section that contains the 
axis of symmetry is representative for the symmetry. Fig. 
B5.3.1 shows axial cross-sections of the simplest field line 
geometries corresponding to (a) dipole, (b) quadrupole 
and (c) octupole fields; each field is rotationally symmetri¬ 
cal about the axis of the configuration. The correspond¬ 
ing zonal spherical harmonics are illustrated symbolically 
by shading the alternate zones in which magnetic field 
lines leave or return to the surface of a sphere. 

The dipole field is horizontal at the equator. In the 
southern hemisphere the field lines leave the reference 
sphere; in the northern hemisphere they return to it. In 
the northern hemisphere the field of an axial quadrupole 
is horizontal at latitude 35.3°N. North of this latitude the 
field lines of the quadrupole leave the reference sphere. 
An equivalent circle of latitude is located in the southern 
hemisphere at 35.3°S; south of this latitude the quadru¬ 
pole field lines also leave the Earth. The field lines re¬ 
enter the Earth in the band of latitudes between those 
where the field is horizontal. The symmetry of the 
quadrupole field is described by these three zones around 
the axis. The axial octupole field exhibits zonal symmetry 
with four zones; two zones in which the field leaves the 
Earth alternate with two zones in which it re-enters it. 

Terms in the potential expansion for which the degree 
m of the Gauss coefficients is equal to the order n (e.g., g\, 
h\, gl, hV) are called sectorial harmonics. Their symmetry 
relative to the Earth’s axis is characterized by an even 
number of sectors around the equator in which the field 
lines alternately leave and re-enter the Earth (Fig. B5.3.2). 
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Fig.B5.3.1 Axial cross-sections showing the field line geometries of 
(a) dipole, (b) quadrupole and (c) octupole fields: each field is 
rotationally symmetrical about the axis of the configuration. The 
corresponding zonal spherical harmonics are illustrated symbolically 
by shading the alternate zones in which magnetic field lines leave or 
return to the surface of a sphere. 
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Fig.B5.3.2 Symmetry relative to the axis of rotation of sectorial and 
tesseral spherical harmonic functions. 


The potential terms for which m + n (e.g., g\, h\, gh\) are 
known as tesseral harmonics. Their pattern of symmetry 
is defined by the intersections of circles of latitude and 
longitude, which outline alternating domains in which 
the field lines leave and re-enter the Earth, respectively. 

Multipole representation allows the very complex 
geometry of the total field to be broken down into contri¬ 
butions from a number of fields with simple geometries. 

By superposing many terms corresponding to these 
simple geometries a field of great complexity can be 
generated. It must be kept in mind that this is not a 
physical expression of the magnetic field, because mag¬ 
netic poles do not exist. It is a convenient technique for 
describing the field mathematically. 
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Fig. 5.3B The energy density spectrum derived from measurements of 
the geomagnetic field made by the MAGSAT Earth-orbiting satellite 
(after Cain, 1989). 

and cannot be attributed any geophysical importance. The 
terms 15^n^40 are due to short-wavelength magnetic 
anomalies associated with the magnetization of the Earth’s 
crust. The terms n < 14 dominate the Earth’s magnetic field 
and are due to much deeper sources in the fluid core. 


5.4.4.1 The dipole field 

The most important part of the Earth’s magnetic field at 
the surface of the Earth is the dipole field, given by the 
Gauss coefficients for which n = 1. If we write only the 
first term of Eq. (5.34) we get the potential 


W= 


cos0 
r 2 


(5.35) 


Note that the spatial variation of this potential depends 
on (cos dir 2 ) in the same way as the potential of a dipole, 
which was found (Eq. (5.8)) to be 


W= 


p. 0 m cos 6 

4ttt 2 


(5.36) 


Comparison of the coefficients of (cos 6/r 2 ) in Eq. (5.35) 
and Eq. (5.36) gives the dipole moment of the Earth’s 
axial dipole in terms of the first Gauss coefficient: 

= (5.37) 


The term g^ is the strongest component of the field. It 
describes a magnetic dipole at the center of the Earth and 
aligned with the Earth’s rotation axis. This is called the 
geocentric axial magnetic dipole. 

The magnetic field B of a dipole is symmetrical about 
the axis of the dipole. At any point at distance r from the 
center of a dipole with moment m on a radius that makes 
an angle 6 to the dipole axis the field of the dipole has a 
radial component B r and a tangential component B g , 
which can be obtained by differentiating the potential 
with respect to r and 9, respectively: 


, _ dW = f l o 2/wcosfl 

r dr 4'tt r 3 

, _ 1 dW_ Ah) msinO 

'»~ r d9 ~ 4tt ,- 3 


(5.38) 

(5.39) 


Note that B r vanishes at the equator (9= 90°) and the 
field is horizontal; comparing Eqs. (5.37) and (5.39) we 
get that the horizontal equatorial field B g is equal to g°. At 
a point (r, 9) on the surface of a uniformly magnetized 
sphere the magnetic field line is inclined to the surface at 
an angle I, which is given by 

B r 

tan/=n- = 2cot0 = 2tanA (5.40) 


The angle / is called the inclination of the field, and 0 is 
the angular distance (or polar angle) of the point of 
observation from the magnetic axis. The polar angle is the 
complement of the magnetic latitude, A (i.e., 9= 90° - A). 
Equation (5.40) has an important application in paleo- 
magnetism, as will be seen later. 

The terms g} and h\ are the next strongest in the poten¬ 
tial expansion. They describe contributions to the potential 
from additional dipoles with their axes in the equatorial 
plane. The total dipole moment of the Earth is then 
obtained from the vector sum of all three components: 

m = ^ R ^(sT> 2 + (g\) 2 + (h\) 2 (5.41) 

The analysis of the geomagnetic field for the year 2005 
gave the following values for the dipole coefficients: g° = 
-29,556.8 nT; gj = -1671.8 nT; h\ = 5080.0 nT. The 
strength of the Earth’s dipole magnetic moment obtained 
by inserting these values in Eq. (5.41) is m — 7.7674 X 10 22 
A m 2 . Note that the sign of g° is negative. This means that 
the axial dipole points opposite to the direction of rota¬ 
tion. Taken together, the three dipole components describe 
a geocentric dipole inclined at about 11.2° to the Earth’s 
rotation axis. This tilted geocentric dipole accounts for 
more than 90% of the geomagnetic field at the Earth’s 
surface. Its axis cuts the surface at the north and south geo¬ 
magnetic poles. For epoch 2005 the respective poles were 
located at 79.7°N, 71.8°W (i.e., 288.2°E) and 79.7°S, 
108.2°E. The geomagnetic poles are antipodal (i.e., exactly 
opposite) to each other. 

The magnetic poles of the Earth are defined as the loca¬ 
tions where the inclination of the magnetic field is ± 90° 
(i.e., where the field is vertically upward or downward). An 
isoclinal map (showing constant inclination values) for the 
year 1980 shows that the location of the north magnetic 
pole was at 77.3°N, 258.2°E while the south magnetic pole 
was at 65.6°S, 139.4°E (Fig. 5.34a). These poles are not 
exactly opposite one another. The discrepancy between 
the magnetic poles and the geomagnetic poles arises 
because the terrestrial magnetic field is somewhat more 
complex than that of a perfect dipole. The intensity of the 
geomagnetic field is generally stronger in high latitudes 
than near the equator (Fig. 5.34b). The intensity is 
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Fig. 5.34 (a) The isoclinal 
map of the geomagnetic field 
for the year 1980 AD (after 
Merrill and McElhinny, 1983), 
and (b) the total intensity of 
the International 
Geomagnetic Reference Field 
(in p.T) for 2000 (source: 
http ://geomag. nasa. gov). 


(a) Inclination 



(b) Total intensity 



especially low over the South Atlantic, where it is about 20 
(jlT weaker than expected. The cause of this “South 
Atlantic magnetic anomaly” is not understood. Because 
the geomagnetic field shields the Earth from cosmic radia¬ 
tion and charged particles of the solar wind, this protec¬ 
tion is less effective over the South Atlantic. Orbiting 
satellites note increased impacts of extra-terrestrial parti¬ 
cles over this region. The feature poses hazards for astro¬ 
nauts in low-orbiting spacecraft and also for pilots and 
passengers in high-altitude aircraft that pass through the 
region. The enhanced particle flux can interfere with their 
on-board computers, communications and guidance 
systems. 

5.4.4.2 The non-dipole field 

The part of the field of internal origin (about 5% of the 
total field), obtained by subtracting the field of the 
inclined geocentric dipole from the total field, is collec¬ 
tively called the non-dipole field. A map of the non-dipole 
field consists of a system of irregularly sized, long- 
wavelength magnetic anomalies (Fig. 5.35). To describe 
this field requires all the terms in the potential expansion 
of order «>2in Eq. (5.34). 


The distribution of positive and negative non-dipole 
field anomalies suggests an alternative representation of 
the non-dipole field to the multipole portrayal. The posi¬ 
tive and negative anomalies have been modelled by 
inward or outward oriented radial dipoles in the core at 
about one-quarter the Earth’s radius. Each dipole is pre¬ 
sumed to be caused by a toroidal current loop parallel to 
the surface of the core. A single centered axial dipole and 
eight auxiliary radial dipoles are adequate to represent 
the field observed at the Earth’s surface. The secular vari¬ 
ation at a site (Section 5.4.5.2) is explained by this model 
as the passage of one of the auxiliary dipoles under the 
site. This model may be a little closer to physical reality 
but it is more unwieldy to handle. The multipole represen¬ 
tation is the most convenient way of modelling the geo¬ 
magnetic potential for mathematical analysis. 

5.4.5 Secular variation 

At any particular place on the Earth the geomagnetic field 
is not constant in time. When the Gauss coefficients of the 
internal field are compared from one epoch to another, 
slow but significant changes in their values are observed. 
The slow changes of the field only become appreciable 
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Fig. 5.35 The vertical component of the non-dipole magnetic field for 
the years 1780 AD (after Yukutake and Tachinaka, 1968) and 1980 AD 
(after Barton, 1989). 


over decades or centuries of observation and so they are 
called secular variations (from the Latin word saeculum for 
a long age). They are manifest as variations of both the 
dipole and non-dipole components of the field. 


5.4.5.1 Secular variation of the dipole field 

The dipole field exhibits secular variations of intensity 
and direction. Calculations of the Gauss coefficients for 
different historical epochs show a near-linear decay of 
the strength of the dipole moment at a rate of about 3.2% 
per century between about 1550 AD and 1900 AD. At 
the start of the twentieth century the decay became even 
faster and has averaged about 5.8% per century during 
the last 80 yr (Fig. 5.36a). If it continues at the same 
almost linear rate, the field intensity would reach zero in 
about another 2000 yr. The cause of the quite rapid decay 
in intensity is not known; it may simply be part of a 
longer term fluctuation. However, another possibility is 
that the dipole moment may be decreasing preparatory 
to the next reversal of geomagnetic field polarity. 

The position of the dipole axis also shows secular 
variation. The changes can be traced by plotting the 
colatitude (the angle between the dipole axis and the rota¬ 
tion axis) and longitude of the geomagnetic pole as a 
function of time. Data are only sufficiently abundant for 
spherical harmonic analysis since the early nineteenth 



Fig. 5.36 Secular variations of the tilted geomagnetic centered dipole 
from 1 550 AD to 1900 AD. (a) Decrease of dipole moment; (b) slow 
changes of the tilt of the dipole axis relative to the rotation axis, and (c) 
longitude variation indicating westward drift of the geomagnetic poles 
(after Barton, 1989). 


century. Less reliable data, enlarged by archeomagnetic 
results (see Section 5.6.2.1), allow estimates of the secular 
variation of the dipole axis since the middle of the six¬ 
teenth century. The earlier data suggest that in the six¬ 
teenth century the dipole axis was tilted at only about 3° 
to the rotation axis; a gradual increase in tilt took place 
between the sixteenth and nineteenth centuries. During 
the last 200 yr the dipole axis has maintained an almost 
constant tilt of about 11-12° to the rotation axis (Fig. 
5.36b). 

For the past 400 yr the longitude of the geomagnetic 
pole has drifted steadily westward (Fig. 5.36c). Before the 
nineteenth century the pole moved westward at about 
0.14° yr -1 ; this corresponds to a pseudo-period of 2600 yr 
for a complete circle about the geographic pole. However, 
since the early nineteenth century the westward motion 
of the pole has been slower, at an average rate of 
0.044°yr _1 , which corresponds to a pseudo-period of 
8200 yr. 
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Fig. 5.37 The variation with latitude of average westward drift rates 
estimated from inclination and declination observations at geomagnetic 
observatories in the northern hemisphere. The curve gives the angular 
rotation rate at the surface of the core for a linear velocity of 
0.058cm s -1 (after Yukutake, 1967). 
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5.45.2 Secular variation of the non-dipole field 

Comparison of maps of the non-dipole field for different 
epochs (Fig. 5.35) show two types of secular variation. 
Some anomalies (e.g., over Mongolia, the South Atlantic 
and North America) appear to be stationary but they 
change in intensity. Other anomalies (e.g., over Africa) 
slowly change position with time. The secular variation of 
the non-dipole field therefore consists of a standing part 
and a drifting part. 

Although some foci may have a north-south compo¬ 
nent of motion, the most striking feature of the secular 
variation of the recent non-dipole field is a slow westward 
drift. This is superposed on the westward drift of the 
dipole, but can be separated readily by spherical har¬ 
monic analysis. The rate of drift of the non-dipole field 
can be estimated from longitudinal changes in selected 
features plotted for different epochs. The mean rate of 
westward drift of the non-dipole field in the first half of 
the last century has been estimated to be 0.18° yr -1 , corre¬ 
sponding to a period of about 2000 yr. Flowever, some 
foci drift at up to about 0.7° yr -1 , much faster than the 
average rate. Results from several geomagnetic observato¬ 
ries show that the rate of drift is dependent on latitude 
(Fig. 5.37). 

Westward drift is an important factor in theories of 
the origin of the geomagnetic field. It is considered to be a 
manifestation of rotation of the outer layers of the core 
relative to the lower mantle. Theoretical models of the 
geomagnetic field (discussed in the next section) presume 
conservation of angular momentum of the fluid core. To 
maintain the angular momentum of a particle of fluid 
that moves radially inwards (decreasing the distance from 


Angular velocity of core 
relative to mantle (° yr -1 ) 

Fig. 5.38 Interpreted velocity distribution (relative to the mantle) for a 
multi-layered core model in which the change of angular momentum of 
each layer due to convectional fluid motion is balanced by 
electromagnetic forces (after Watanabe and Yukutake, 1975). 

the rotational axis) its angular rate of rotation must speed 
up. This results in a layered structure for the radial profile 
of angular rate of rotation relative to the mantle (Fig. 
5.38). The outer layers of the core probably rotate more 
slowly than the solid mantle, imparting a westward drift 
to features of the magnetic field rooted in the fluid 
motion. 

5.4.6 Origin of the internal field 

Analysis of the Gauss coefficients and the wavelengths of 
features of the non-dipole field indicate that the main 
field is produced in the fluid outer core of the Earth. The 
composition of the fluid core has been estimated from 
seismic and geochemical data. The major constituent is 
liquid iron, with smaller amounts of other less dense ele¬ 
ments. Geochemical analyses of iron meteorites suggest 
that the core composition may have a few percent of 
nickel, while shock-wave experiments require 6-10% of 
non-metallic light elements such as silica, sulfur or 
oxygen. The solid inner core is inferred from seismic and 
shock-wave data to consist of almost pure iron. 

For the generation of the magnetic field the important 
parameters of the core are its temperature, viscosity and 
electrical conductivity. Temperature is known very 
poorly inside the Earth, but probably exceeds 3000 °C in 
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the liquid core. The electrical conductivity of iron at 
20 °C is 10 7 II -1 m _1 and decreases with increasing tem¬ 
perature. At the high temperatures and pressures in the 
core the electrical conductivity is estimated to be around 
(3-5) X 10 5 fl _1 m _1 , which corresponds to a good 
electrical conductor. For comparison, the conductivity 
of carbon (used for many electrical contacts) is 
3 X 10 4 at 20 °C. 

5.4.6.1 Magnetostatic and electromagnetic models 

As observed by Gilbert in 1600, the dipole field of the 
Earth resembles that of a uniformly magnetized sphere. 
However, permanent magnetization is an inadequate 
explanation for the geomagnetic dipole. The mean mag¬ 
netization of the Earth would need to be many times 
greater than that of the most strongly magnetic crustal 
rocks. The Curie point of the most important minerals at 
atmospheric pressure is less than 700 °C, which is reached 
at depths of about 25 km so that only a thin outer crustal 
shell could be permanently magnetized. The necessary 
magnetization of this shell is even greater than values 
observed in crustal rocks. Moreover, a magnetostatic 
origin cannot account for the temporal changes observed 
in the internal field, such as its secular variation. 

The main magnetic field of the Earth is thought to be 
produced by electrical currents in the conductive core. 
Although the core is a good conductor, an electrical 
current system in the core continually loses energy 
through ohmic dissipation. The lost electrical energy is 
converted to heat and contributes to the thermal balance 
of the core. The equations of electromagnetism applied to 
the core show that an electrical current in the core would 
decay to zero in around 10,000-20,000 yr unless it is sus¬ 
tained. Paleomagnetic evidence in the form of coherently 
magnetized rocks supports the existence of a geomag¬ 
netic field since Precambrian time, i.e. for about 3 000 
Myr. This implies that it must be continuously main¬ 
tained or regenerated. The driving action for the main 
field is called the dynamo process, by analogy to the pro¬ 
duction of electrical power in a conductor that rotates in 
a magnetic field. 

5.4.62 The geomagnetic dynamo 

When a charged particle moves through a magnetic field, 
it experiences a deflecting electrical field (called the 
Lorentz field) proportional to the magnetic flux density B 
and the particle velocity v, and acting in the direction 
normal to both B and v. The Lorentz field acts as an addi¬ 
tional source of electrical current in the core. Its strength 
is dependent on the velocity of motion of the conducting 
fluid relative to the magnetic field lines. When this term is 
included in the Maxwell electromagnetic equations, a 
magnetohydrodynamic equation relating the magnetic 
field B to the fluid flow v and conductivity cr in the core is 
obtained. It is written 


W = /^ v2b + VMvXB ) (5 - 42) 

This vector equation, although complicated, has imme¬ 
diate consequences. The left side gives the rate of change 
of magnetic flux in the core; it is determined by two terms 
on the right side. The first is inversely dependent on the 
electrical conductivity, and determines the decay of the 
field in the absence of a driving potential; the better the 
conductor, the smaller is this diffusion term. The second, 
dynamo term, depends on the Lorentz electrical field, 
which is determined by the velocity field of the fluid 
motions in the core. The conductivity of the outer core 
((3-5) X 10 5 II -1 m -1 ) is high and for a fluid velocity of 
about lnims -1 the dynamo term greatly exceeds the 
diffusion term. Under these conditions, the lines of mag¬ 
netic flux in the core are dragged along by the fluid flow. 
This concept is called the frozen-flux theorem, and it is fun¬ 
damental to dynamo theory. The diffusion term is only 
zero if the electrical conductivity is infinite. There is proba¬ 
bly some diffusion of the field through the fluid, because it 
is not a perfect conductor. However, the frozen-flux 
theorem appears to approximate well the conditions in the 
fluid outer core. 

The derivation of a solution of the dynamo theory is 
difficult. In addition to Maxwell's equations with the addi¬ 
tion of a term for the Lorentz field, the Navier-Stokes 
equation for the fluid flow, Poisson’s equation for the grav¬ 
itational potential and the generalized equation of heat 
transfer must be simultaneously satisfied. The fluid flow 
consists of a radial component and a rotational compo¬ 
nent. The energy for the radial flow comes from two 
sources. The slow cooling of the Earth produces a temper¬ 
ature gradient in the core, which results in thermally 
driven convection in the iron-rich fluid of the outer core. 
This is augmented by latent heat released at the boundary 
between the inner and outer cores as the inner core solidi¬ 
fies. The solidification of the pure iron inner core depletes 
the fluid of the outer core of its heaviest component. The 
remaining lighter elements rise through the liquid outer 
core, causing a buoyancy-driven convection. 

The rotational component of the fluid flow is the result 
of a radial velocity gradient in the liquid core, with inner 
layers rotating faster than outer layers (Fig. 5.38). The rel¬ 
ative rotation of the conducting fluid drags magnetic field 
lines around the rotational axis to form ring-like, toroidal 
configurations. The toroidal field lines are parallel to the 
flow and therefore to the surface of the core. This means 
that the toroidal fields are confined to the core and cannot 
be measured; their strengths and configurations must be 
estimated from models. Their interactions with the 
upwelling and descending branches of convective cur¬ 
rents create electrical current systems that produce 
poloidal magnetic fields. These, in their turn, escape from 
the core and can be measured at the surface of the Earth. 
The fluid motions are subject to the effects of Coriolis 
forces, which prove to be strong enough to dominate the 
resultant flow patterns. 
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5.4.63 Computer simulation of the geodynamo 

Important advances in understanding the geomagnetic 
dynamo (or geodynamo) have been made by simulating 
the related core processes with supercomputers. In 1995, 
G. A. Glatzmaier and P. H. Roberts presented a numeri¬ 
cal model for the generation of a magnetic field assuming 
a hot convective fluid outer core surrounding a solid inner 
core, with rotation rate akin to that of the Earth. The heat 
flow, electrical conductivity and other material properties 
were made as similar as possible to those of the Earth’s 
core. The simulated dynamo had dominantly dipole char¬ 
acter and intensity similar to the Earth’s, and it exhibited 
a westward drift of the non-dipole field comparable to 
that measured at the Earth's surface. It also reversed 
polarity spontaneously, with long periods of constant 
polarity between short polarity transitions, as is the case 
for the history of geomagnetic reversals in the last 
160 Myr (Section 5.7). During a polarity reversal, the field 
intensity decreased by an order of magnitude, as is also 
observed in paleomagnetic studies (see Fig. 5.71). 

The simulations showed that the ability to reverse 
polarity was increased when the heat flow across the 
core-mantle boundary was non-uniform, as in the Earth, 
showing that thermal conditions in the lower mantle 
influence the formation of the magnetic field in the fluid 
core. The solid inner core evidently plays an important 
role in the reversal process. Magnetic fields in the outer 
core can change quickly, accompanying convection, and 
may act to initiate reversals. However, magnetic fields in 
the solid inner core change more slowly by diffusion and 
thus the inner core may act to stabilize the field against 
reversals. A reversal occurs occasionally, on a longer time 
scale than the core processes. 

A prediction of this model is that the magnetic field 
couples the inner core to the eastward flowing fluid above 
it, so that the inner core rotates slightly faster than the 
mantle and crust (Fig. 5.38). There is seismic evidence 
that this indeed occurs. The solid inner core is seismically 
anisotropic, with cylindrical symmetry about an axis 
close to the rotation axis. The axis of symmetry rotates at 
a rate about 1° faster than that of the mantle and crust. 

5.4.7 Magnetic fields of the Sun, Moon and planets 

Our knowledge of the magnetic fields of the Sun and 
planets derives from two types of observation. Indirect 
observations utilize spectroscopic effects. All atoms emit 
energy related to the orbital and spin motions. The energy 
is quantized so that the atom possesses a characteristic 
spectrum of energy levels. The lowest of these is called the 
ground state. When an atom happens to have been excited 
to a higher energy level, it is unstable and eventually 
returns to the ground state. In the process the energy cor¬ 
responding to the difference between the elevated and 
ground states is emitted as light. For example atomic 
hydrogen gas in the Sun and galaxies emits radiation at 


1420 MHz, with corresponding wavelength of 21cm. 
This frequency is in the microwave range and can be 
detected by radio telescopes. If the hydrogen gas is in 
motion the frequency is shifted by the Doppler effect, 
from which the velocity of the motion can be deduced. 

In the presence of a magnetic field a spectral line can 
split into several lines. This “hyperfine splitting” is called 
the Zeeman effect. When a hydrogen atom is in the 
ground state its hyperfine structure has only one line. 
However, if the atom is in a magnetic field, it acquires 
additional energy from the interactions between the atom 
and the magnetic field and hence it can exist in several 
energy states. As a result the spectral lines of energy 
emitted by the atom are split into closely spaced lines, rep¬ 
resenting transitions between the different energy states. 
The energy differences between these states are dependent 
on the strength of the magnetic field, which can be esti¬ 
mated from the observations. 

Direct observations of extra-terrestrial magnetic fields 
have been carried out since the 1960s by space probes. 
Magnetometers mounted in these spacecraft have 
recorded directly the intensity of the interplanetary mag¬ 
netic field as well as the magnetic fields around several 
planets. The manned Apollo missions to the Moon 
resulted in a large amount of data obtained from the 
orbiting spacecraft. The materials collected on the 
Moon’s surface and brought back to Earth by the astro¬ 
nauts have provided valuable information about lunar 
magnetic properties. 

5.4.7.1 Magnetic field of the Sun 

The Sun has nearly 99.9% of the mass of the solar system. 
About 99% of this mass is concentrated in a massive 
central core that reaches out to about 80% of the Sun’s 
radius. The remainder of the Sun consists of an outer 
conducting shell, which contains only 1% of the Sun’s 
mass but has a thickness equivalent to about 20% of the 
solar radius. Thermonuclear conversion of hydrogen into 
helium in the dense core produces temperatures of the 
order of 15,000,000 K. The visible solar disk is called the 
photosphere. Its diameter is about 2,240,000 km (about 
175 times the diameter of the Earth) and its surface tem¬ 
perature is about 6000 K. The lower solar atmosphere is 
called the chromosphere ; the outer atmosphere is called 
the corona. The chromosphere includes spike-like emis¬ 
sions of hydrogen gas called solar prominences that 
sometimes can reach far out into the corona. 

The heat from the core is radiated out to the outer con¬ 
ducting shell, where it sets up convection currents. Some 
of these convection currents are small scale (about 
1000 km across) and last only a few minutes; others are 
larger scale (30,000 km across) and persist for about a 
(terrestrial) day. The convection is affected by the Sun’s 
rotation and is turbulent. 

The rotation of the Sun has been estimated spectro¬ 
scopically by measuring the Doppler shift of spectral 



5.4 GEOMAGNETISM 


317 


lines. Independent estimates come from observing the 
motion of solar features like sunspots, etc. The rotational 
axis is tilted slightly at 7° to the pole to the ecliptic. Near 
the Sun’s equator the rotational period is about 25 Earth 
days; the polar regions rotate more slowly with a period 
of 35-40 days. The Sun’s core may rotate more rapidly 
than the outer regions. Turbulent convection and velocity 
shear in the outer conducting shell are conducive to a 
dynamo origin for the Sun’s magnetic field. The surface 
field is dipolar in higher latitudes but has a more compli¬ 
cated pattern in equatorial latitudes. 

Temperatures in the outer solar atmosphere (corona) 
are very high, around 1,000,000 K. The constituent parti¬ 
cles achieve velocities in excess of the escape velocity of 
the Sun’s gravitational field. The supersonic stream of 
mainly protons (H + ions), electrons and a-particles 
(He 2+ ions) escaping from the Sun forms the solar wind. 
The flow of electric charge produces an interplanetary 
magnetic field (IMF) of varying intensity. At the distance 
of the Earth from the Sun the IMF measures about 6 nT. 

Sunspots have been observed since the invention of the 
telescope (ca. 1610 AD). A sunspot is a dark fleck on the 
surface of the Sun, measuring roughly 1000 to 100,000 km 
in diameter. It has a lower temperature than the surround¬ 
ing photosphere and represents a strong disturbance 
extending far into the Sun’s interior. Sunspots last for 
several days or weeks and move with the Sun’s rotation, 
providing a means of estimating the rotational speed. The 
frequency of sunspots changes cyclically with a period of 
11 years. 

Intense magnetic fields are associated with the 
sunspots. These often occur in unequally sized pairs of 
opposite polarity. The predominating magnetic polarity 
of sunspots changes from one period of maximum 
sunspot activity to the next, implying that the period is in 
fact 22 years. The magnetic field of each sunspot is 
toroidal. It can be imagined to resemble a vortex, or 
tornado, in which the magnetic field lines leave the solar 
surface in one sunspot and return to it in the other 
member of the pair. The polarity of the Sun’s dipole field 
also reverses with the change of polarity of the sunspots, 
indicating that the features are related. 

Associated with the sunspots and their strong mag¬ 
netic fields are emissions of hydrogen gas, called solar 
flares. The charged particles ejected in the flares con¬ 
tribute to the solar wind, which consequently transfers 
the sunspot cyclicity to fluctuations in the Earth’s mag¬ 
netic field of external origin and to ancillary terrestrial 
phenomena such as magnetic storms, brilliant auroras 
and interference with radio transmissions. 

54.7.2 Lunar magnetism 

Classical hypotheses for the origin of the Moon - rota¬ 
tional fission, capture or binary accretion - have been 
superseded by the Giant Impact hypothesis. According to 
this model, the Earth experienced a catastrophic collision 


with a Mars-sized protoplanet early in its development. 
Some debris from the collision remained in orbit around 
the Earth, where it re-accumulated as the Moon about 
4.5 Ga ago. This origin accounts for why the Earth and 
Moon have the same oxygen isotope composition. At the 
time of the Giant Impact the Earth's mantle and core had 
already differentiated and dense iron had settled into the 
core. This may also have taken place in the impactor so 
the Moon formed from the iron-depleted rocky mantles 
of the impacting bodies. The Moon has only a small core, 
whereas if it had originated like the other planets its core 
would be proportionately larger. 

The early Moon was probably covered by an ocean of 
molten magma at least a hundred kilometers thick, the 
relicts of which are now the major constituents of the 
lunar highlands. After cooling and solidification, the lunar 
surface was bombarded by planetesimals and meteoroids 
until about 4 Ga ago. The huge craters left by the impacts 
were later filled by molten basalt to form the lunar maria. 
Rock samples recovered from the maria in the manned 
Apollo missions have been dated radiometrically at 
3.1-3.9 Ga. The volcanism may have continued after this 
time but probably ceased about 2.5 Ga ago. Since then the 
lunar surface has been pulverized by the constant bom¬ 
bardment by meteorites, micrometeorites and elementary 
particles of the solar wind and cosmic radiation. Except 
for the lunar highlands the surface is now covered by a 
layer of shocked debris several meters thick called the 
lunar regolith. 

Our knowledge of lunar magnetism derives from mag¬ 
netic field measurements made from orbiting spacecraft, 
magnetometers set up on the Moon and rock samples col¬ 
lected from the lunar surface and brought back to Earth. 
Measurements of the lunar surface field from orbiting 
spacecraft have been made by on-board magnetometers 
and by the electron reflection method, the principle of 
which is basically the same as used to explain the origin of 
the Van Allen belts in the Earth’s magnetosphere (see 
Section 5.4.3.1). Electrons rain abundantly upon the 
lunar surface, in part from the solar wind and in part from 
the charges trapped in the Earth’s magnetotail, which is 
crossed by the monthly orbit of the Moon about the 
Earth. In the absence of a lunar magnetic field the elec¬ 
trons would be absorbed by the lunar surface. As in the 
Earth’s magnetosphere, an electron incident on the Moon 
is forced by the Lorentz force to spiral about a magnetic 
field line (see Fig. 5.30). As the electron approaches the 
lunar surface the magnetic field strength increases and the 
pitch of the helix decreases to zero. At the point of closest 
approach (also called the mirroring point) the electron 
motion is completely rotational about the field line. The 
electron path then spirals with increasing pitch back 
along the field line, away from the Moon. The effective¬ 
ness of the electron reflection is directional; for a given 
electron velocity there is a critical angle of incidence 
below which reflection ceases. By counting the number of 
electrons with a known energy that are reflected past a 
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Table 5.2 Magnetic characteristics of the planets (data source: Van Allen and Bagenal, 1999) 


Planet 

Mean 

orbital 

radius 

[AU] 

Mean 
radius 
of planet 
[km] 

Period 

of 

rotation 

[days] 

Magnetic 

dipole 

moment 

['%:] 

Equivalent 
equatorial 
magnetic 
field [nT] 

Dipole 
tilt to 
rotation 
axis [°] 

Mercury 

0.3830 

2,440 

58.81 

0.0007 

300 

14 

Venus 

0.7234 

6,052 

243.7(R) 

< 0.0004 

<3 

— 

Earth 

1 

6,371 

1 

1 

30,500 

10.8 

Moon 

0.00257 

1,738 

27.32 

— 

— 

— 

Mars 

1.520 

3,390 

1.0275 

< 0.0002 

<30 

— 

Jupiter 

5.202 

69,910 

0.414 

20,000 

428,000 

9.6 

Saturn 

9.576 

58,230 

0.444 

600 

22,000 

< 1 

Uranus 

19.19 

25,362 

0.720(R) 

50 

23,000 

58.6 

Neptune 

30.05 

24,625 

0.671 

25 

14,000 

47 


satellite in known directions, the surface field B s can be 
estimated from the field B 0 measured at the altitude of the 
satellite. 

The lunar magnetic field was surveyed extensively in 
1998 by the Lunar Prospector orbiting spacecraft. The 
measurements show that the Moon currently has no 
detectable dipole moment. It has only a very weak non- 
dipolar magnetic field due to local magnetization of 
crustal rocks. Estimates of surface anomalies suggest that 
the largest are of the order of 100 nT. From records of 
meteoritic impacts and moonquakes made by seismo¬ 
meters left on the Moon for several years it has been 
inferred that, if the Moon has a core (necessary for a 
lunar dynamo), it must be smaller than 400-500 km in 
radius, representing less than 2% of the Moon’s volume 
and 1-3% of its mass. In contrast, the Earth's core occu¬ 
pies about 16% of the planet’s volume and has about 33% 
of its mass. Lunar Prospector detected magnetic field 
changes due to electrical currents induced in the Moon as 
it traversed the stretched out tail of Earth's magnetos¬ 
phere. They suggest an even smaller, iron-rich metallic 
core, with radius 340 ± 90 km (Hood et al., 1999). 

Although the Moon now has no global dipole mag¬ 
netic field, samples of lunar rocks recovered in the 
manned Apollo missions possessed quite strong natural 
remanent magnetizations. Rock magnetic studies on 
Apollo samples with radiometric ages of ~3.6-3.9Ga 
suggest that they were magnetized in fields of the order of 
10-100 p,T (0.1-1 gauss), much stronger than present 
fields on the Moon. As yet, the ancient lunar magnetic 
field is not well understood. The interior of the Moon 
may have acquired a primordial remanent magnetization 
in the external field of the Sun or Earth. In turn this may 
have provided the fields for acquisition of the observed 
crustal magnetizations. The small lunar metallic core is 
currently solid, and the core heat flux is too low to have 
powered an internal dynamo at the time of formation of 
the Apollo samples. However, models suggest that a thin 
layer of the Moon’s mantle adjacent to the core may 
have first blanketed the core, then later provided enough 
radioactive heating to assist dynamo activity that 


persisted for a limited period during a “magnetic era” 
about 0.5-1.0 Ga after the Moon was formed (Stegman et 
al., 2003). 

5.4.73 Extra-terrestrial magnetic exploration 

Jupiter and Saturn, like the Earth, have magnetic fields 
that are strong enough to trap charged particles. The 
motions of these charged particles generate electromag¬ 
netic radiation which is detectable as radio waves far from 
the planet. The magnetic field of Jupiter was first detected 
in this way. Moreover, the radio emissions are modulated 
by the rotation of the planet. Analysis of the periodicity 
of their modulated radio emissions provides the best esti¬ 
mates of the rotational rates of Jupiter and Saturn. 

Most data concerning the magnetic fields of the 
planets (Table 5.2) have been obtained with flux-gate 
magnetometers (see Section 5.5.2.1) installed in passing 
or orbiting spacecraft. When the spacecraft traverses the 
magnetosphere of a planet, the magnetometer registers 
the passage through the bow shock and magnetopause 
(Fig. 5.39). A bow shock results from the supersonic colli¬ 
sion of the solar wind with the atmosphere of a planet, 
just as it does for the Earth (see Fig. 5.28). Counters of 
energetic particles register a sudden increase in frequency 
and the magnetometer shows a change in magnetic field 
strength during passage through the bow shock into the 
magnetosheath. When the spacecraft leaves the magne¬ 
tosheath and crosses the magnetopause it enters the 
region that is shielded from the solar wind by the mag¬ 
netic field of the planet. The magnetopause is where the 
kinetic energy of the plasma is equal to the potential 
energy of the planetary magnetic field. The existence of a 
bow shock may be regarded as evidence for a planetary 
atmosphere, while the magnetopause is evidence that the 
planet has a magnetic field. 

5.4.74 The magnetic fields of the planets 

Mercury was visited by the Mariner 10 spacecraft, which 
made three passes of the planet in 1974 and 1975. The 
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Fig. 5.39 Changes with time 
of particle density and 
magnetic field along the path 
of the Mariner 10 spacecraft 
during its flight past the 
planet Mercury (data from 
Ogilvie etal., 1977). 


Mariner 10 flyby of Mercury 



on-board magnetometer detected a bow shock and a 
magnetopause (Fig. 5.39), which imply that the planet 
has a magnetic field. On the first encounter a magnetic 
field of about lOOnT was measured at an altitude of 
700 km (radial distance of 3100 km), which was the point 
of closest approach. In estimating the magnetic field of 
the planet compensation must be made for the magnetic 
field of the solar wind. Different models have given esti¬ 
mates of the strength of the dipole moment in the range 
(2-7) X 10 -4 of the Earth's magnetic moment (w K ~ 
7.7674X 10 22 Am 2 in 2005), which would give a surface 
equatorial magnetic field of about 100-400 nT. The 
source of the magnetic field is uncertain. It is possible, but 
unlikely, that Mercury has a global magnetic moment due 
to crustal remanent magnetization; it would be difficult to 
explain the uniformity and duration of the external field 
needed to produce this. Mercury is believed to have a 
large iron core with a radius of about 1800 km, propor¬ 
tionately larger than Earth’s. Probably part of this core is 
molten, so it is possible that Mercury has a small active 
internal dynamo. This would however be unlike Earth’s. It 
might be of thermoelectric origin, or it could be due to 
dynamo processes in a thin shell-like liquid outer core 
surrounding a solid inner core. 

Venus was investigated by several American and 
Russian spacecraft in the 1960s. The instruments on 
Mariner 5 in 1967 clearly detected a bow shock from the 
collision of the solar wind with the planetary atmosphere. 
Magnetometer data from later spacecraft found no evi¬ 
dence for a planetary magnetic field. If a magnetopause 
exists, it must lie very close to the planet and may wrap 
around it. Data from the Pioneer Venus orbiter in 1978 set 
an upper limit of 10 ~ 5 m E for a planetary dipole moment, 
which would give a surface equatorial field of less than 1 
nT. The absence of a detectable magnetic field was not 
expected. As shown by Eq. (5.37) the dipole magnetic 
moment is proportional to the equatorial field times the 


cube of the planetary radius. The strength of a dynamo 
field is expected to be proportional to the core radius and 
to the rotation rate. These considerations give a scaling 
law whereby the dipole magnetic moment of a planet is 
proportional to its rotation rate and the fourth power of 
the core radius. The rotation of the planet is very slow 
compared to that of the Earth; one sidereal day on Venus 
lasts 243 Earth days, but the size of the planet is close to 
that of the Earth. It was therefore expected that Venus 
might have an internal dynamo with a dipole moment 
about 0.2% that of the Earth and an equatorial surface 
field of about 86 nT. It seems likely that the slow rotation 
does not provide enough energy for an active dynamo. 

Mars was expected to have a magnetic moment 
between that of Earth and Mercury, because of its size 
and rotation rate. Mariner 4 in 1965 was the first 
American spacecraft carrying magnetometers to visit 
Mars; it detected a bow shock but no conclusive evidence 
for a magnetopause. In September 1997, the Mars Global 
Surveyor spacecraft entered orbit around the planet. 
Since 1999 it has mapped the Martian magnetic field from 
an almost circular orbit about 400 ±30 km above the 
planet’s surface. The survey measurements have nearly 
uniform global coverage and show that there is no signifi¬ 
cant global magnetic field at present. This does not 
exclude the possible existence of a dynamo-generated 
global field in the planet’s distant past. The measured 
magnetic field consists of large regional magnetic anom¬ 
alies, which are attributed to remanent magnetization of 
the Martian crust. At the survey altitude the crustal mag¬ 
netic anomalies have amplitudes up to 220 nT, an order of 
magnitude larger than the crustal anomalies of Earth at 
that altitude, which attests to the presence of strongly 
magnetic minerals in the Martian crust. The crustal mag¬ 
netic field shows some features with circular geometry, 
some of which may be related to impact processes. 
Flowever, the most striking anomalies are prominent 
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east-west trending linear features over a region in the 
planet's southern hemisphere. These sub-parallel lin- 
eations are attributed to alternating bands of strongly 
magnetized crust. It is tempting to interpret their origin in 
the same way as the generation of oceanic magnetic lin- 
eations on Earth by a plate tectonic process in the pres¬ 
ence of a reversing field. However, as yet the origin of the 
magnetic anomalies on Mars is not understood. 

Jupiter has been known since 1955 to possess a strong 
magnetic field, because of the polarized radio emissions 
associated with it. The spacecraft Pioneer 10 and 11 in 
1973-4 and Voyager 1 and 2 in 1979 established that the 
planet has a bow shock and magnetopause. From 1995 
to 2003 the spacecraft Galileo made extensive surveys 
of Jupiter’s magnetosphere. The huge magnetosphere 
encounters the solar wind about 5,000,000 km “upwind” 
from the planet; its magnetotail may extend all the way to 
Saturn. Two reasons account for the great size of the mag¬ 
netosphere compared to that of Earth. First, the solar 
wind pressure on the Jovian atmosphere is weaker due to 
the greater distance from the Sun; secondly, Jupiter’s 
magnetic field is much stronger than that of the Earth. 
The dipole moment is almost 20,000w E which gives a 
powerful equatorial magnetic field of more than 
400,000 nT at Jupiter’s surface. The quadrupole and octu- 
pole parts of the non-dipole magnetic field have been 
found to be proportionately much larger relative to the 
dipole field than on Earth. The dipole axis is tilted at 9.7° 
to the rotation axis, and is also displaced from it by 10% 
of Jupiter's equatorial radius. The magnetic field of 
Jupiter results from an active dynamo in the metallic 
hydrogen core of the planet. The core is probably very 
large, with a radius up to 75% of the planet's radius. This 
would explain the high harmonic content of the magnetic 
field near the planet. 

Saturn was reached by Pioneer 11 in 1979 and the 
Voyager 1 and 2 spacecraft in 1980 and 1981, respectively. 
The on-board magnetometers detected a bow shock and 
a magnetopause. In 2004 the Cassini-Huygens spacecraft 
entered into orbit around Saturn. In 2005 the Huygens 
lander descended to the surface of Saturn’s largest moon 
Titan while Cassini continued to orbit and measure the 
parent planet's properties. Saturn’s dipole magnetic 
moment is smaller than expected, but is estimated to be 
around 500 /m e . This gives an equatorial field of 55,000 
nT, almost double that of Earth. The magnetic field has a 
purer dipole character (i.e., the non-dipole components 
are weaker) than the fields of Jupiter or Earth. The sim¬ 
plest explanation for this is that the field is generated by 
an active dynamo in a conducting core that is smaller rel¬ 
ative to the size of the planet. The axis of the dipole mag¬ 
netic field lies only about 1° away from the rotation axis, in 
contrast to 11.4° on Earth and 9.7° on Jupiter. 

Uranus is unusual in that its spin axis has an obliquity 
of 97.9°. This means that the rotation axis lies very close 
to the ecliptic plane, and the orbital planes of its satellites 
are almost orthogonal to the ecliptic plane. Uranus was 


visited by Voyager 2 in January 1986. The spacecraft 
encountered a bow shock and magnetopause and subse¬ 
quently entered the magnetosphere of the planet, which 
extends for 18 planetary radii (460,000 km) towards the 
Sun. Uranus has a dipole moment about 50 times 
stronger than Earth's, giving a surface field of 24,000 nT, 
comparable to that on Earth. Intriguingly, the axis of the 
dipole field has a large tilt of about 60° to the rotation 
axis; there is no explanation for this tilt. Another oddity is 
that the magnetic field is not centered on the center of the 
planet, but is displaced by 30% of the planet’s radius 
along the tilted rotation axis. The quadrupole component 
of the field is relatively large compared to the dipole. It is 
therefore supposed that the magnetic field of Uranus is 
generated at shallow depths within the planet. 

Neptune, visited by Voyager 2 in 1989, has a magnetic 
field with similar characteristics to that of Uranus. It is 
tilted at 49° to the rotation axis and is offset from the plan¬ 
etary center by 55% of Neptune’s radius. As in the case of 
Uranus, the magnetic field has a large quadrupole term 
compared to the dipole and thus probably originates in the 
outer layers of the planet, rather than in the deep interior. 

It is not yet known whether Pluto has a magnetic field. 
The planet is probably too small to have a magnetic field 
sustained by dynamo action. 

There is reasonable confidence that Mercury, Earth, 
Jupiter, Saturn, and Uranus have active planetary dynamos 
today. The magnetic data for Mars and Neptune are incon¬ 
clusive. All available data indicate that Venus and the 
Moon do not have active dynamos now, but possibly each 
might have had one earlier in its history. 


5.5 MAGNETIC SURVEYING 

5.5.1 The magnetization of the Earth's crust 

The high-order terms in the energy density spectrum of 
the geomagnetic field (Fig. 5.33) are related to the magne¬ 
tization of crustal rocks. Magnetic investigations can 
therefore yield important data about geological struc¬ 
tures. By analogy with gravity anomalies we define a mag¬ 
netic anomaly as the difference between the measured 
(and suitably corrected) magnetic field of the Earth and 
that which would be expected from the International 
Geomagnetic Reference Field (Section 5.4.4). The mag¬ 
netic anomaly results from the contrast in magnetization 
when rocks with different magnetic properties are adja¬ 
cent to each other, as, for example, when a strongly mag¬ 
netic basaltic dike intrudes a less magnetic host rock. The 
stray magnetic fields surrounding the dike disturb the 
geomagnetic field locally and can be measured with sensi¬ 
tive instruments called magnetometers. 

As discussed in Section 5.3.1, each grain of mineral in 
a rock can be classified as having diamagnetic, paramag¬ 
netic or ferromagnetic properties. When the rock is in a 
magnetic field, the alignment of magnetic moments 
by the field produces an induced magnetization (M ; ) 
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Fig. 5.40 The remanent (M r ), induced (M), and total (A/f t ) magnetizations 
in a rock, (a) For an arbitrary case M t lies between M- and M r , (b) for a very 
large Konigsberger ratio (Q n s>1) M t is close to M r , and (c) for a very small 
Konigsberger ratio (Q n <!1) M t is almost the same as M r 

proportional to the field, the proportionality constant 
being the magnetic susceptibility, which can have a wide 
range of values in rocks (see Fig. 5.13). The geomagnetic 
field is able to produce a correspondingly wide range of 
induced magnetizations in ordinary crustal rocks. The 
direction of the induced magnetization is parallel to the 
Earth’s magnetic field in the rock. 

Each rock usually contains a tiny quantity of ferro¬ 
magnetic minerals. As we have seen, these grains can 
become magnetized permanently during the formation of 
the rock or by a later mechanism. The remanent magneti¬ 
zation (M r ) of the rock is not related to the present-day 
geomagnetic field, but is related to the Earth’s magnetic 
field in the geological past. Its direction is usually 
different from that of the present-day field. As a result the 
directions of M r and IVT are generally not parallel. The 
direction of M ; is the same as that of the present field but 
the direction of M r is often not known unless it can be 
measured in rock samples. 

The total magnetization of a rock is the sum of the 
remanent and induced magnetizations. As these have 
different directions they must be combined as vectors (Fig. 
5.40a). The direction of the resultant magnetization of the 


rock is not parallel to the geomagnetic field. If the intensi¬ 
ties of M r and M ; are similar, it is difficult to interpret the 
total magnetization. Fortunately, in many important situ¬ 
ations M r and M ; are sufficiently different to permit some 
simplifying assumptions. The relative importance of the 
remanent and induced parts of the magnetization is 
expressed in the Konigsberger ratio (0 n ), defined as the 
ratio of the intensity of the remanent magnetization to 
that of the induced magnetization (i.e., Q n = M r /M ; ). 

Two situations are of particular interest. The first is 
when Q n is very large (i.e., Q n M). In this case (Fig. 5.40b), 
the total magnetization is dominated by the remanent com¬ 
ponent and its direction is essentially parallel to M r . 
Oceanic basalts, formed by extrusion and rapid underwater 
cooling at oceanic ridges, are an example of rocks with high 
Q n ratios. Due to the rapid quenching of the molten lava, 
titanomagnetite grains form with skeletal structures and 
very fine grain sizes. The oceanic basalts carry a strong 
thermoremanent magnetization and often have Q n values of 
100 or greater. This facilitates the interpretation of oceanic 
magnetic anomalies, because in many cases the induced 
component can be neglected and the crustal magnetization 
can be interpreted as if it were entirely remanent. 

The other important situation is when Q n is very small 
(i.e., Q n < 1). This requires the remanent magnetization to 
be negligible in comparison to the induced magnetization. 
For example, coarse grained magnetite grains carry multi- 
domain magnetizations (Section 5.3.5.3). The domain 
walls are easily moved around by a magnetic field. The sus¬ 
ceptibility is high and the Earth’s magnetic field can induce 
a strong magnetization. Any remanent magnetization is 
usually weak, because it has been subdivided into antipar¬ 
allel domains. These two factors yield a low value for Q n . 
Magnetic investigations of continental crustal rocks for 
commercial exploitation (e.g., in ancient shield areas) can 
often be interpreted as cases with 0 n <^l. The magnetiza¬ 
tion can then be assumed to be entirely induced (Fig. 
5.40c) and oriented parallel to the direction of the present- 
day geomagnetic field at the measurement site, which is 
usually known. This makes it easier to design a model to 
interpret the feature responsible for the magnetic anomaly. 

5.5.2 Magnetometers 

The instrument used to measure magnetic fields is called a 
magnetometer. Until the 1940s magnetometers were 
mechanical instruments that balanced the torque of the 
magnetic field on a finely balanced compass needle against 
a restoring force provided by gravity or by the torsion in a 
suspension fiber. The balance types were cumbersome, 
delicate and slow to operate. For optimum sensitivity they 
were designed to measure changes in a selected compo¬ 
nent of the magnetic field, most commonly the vertical 
field. This type of magnetometer has now been super¬ 
seded by more sensitive, robust electronic instruments. 
The most important of these are the flux-gate, proton- 
precession and optically pumped magnetometers. 
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55.2.1 The flux-gate magnetometer 

Some special nickel-iron alloys have very high magnetic 
susceptibility and very low remanent magnetization. 
Common examples are Permalloy (78.5% Ni, 21.5% Fe) 
and Mumetal (77% Ni, 16% Fe, 5% Cu, 2% Cr). The 
preparation of these alloys involves annealing at very 
high temperature (1100-1200 °C) to remove lattice defects 
around which internal stress could produce magnetostric- 
tive energy. After this treatment the coercivity of the alloy 
is very low (i.e., its magnetization can be changed by a 
very weak field) and its susceptibility is so high that the 
Earth’s field can induce a magnetization in it that is a con¬ 
siderable proportion of the saturation value. 

The sensor of a flux-gate magnetometer consists of 
two parallel strips of the special alloy (Fig. 5.41a). They 
are wound in opposite directions with primary energiz¬ 
ing coils. When a current flows in the primary coils, the 
parallel strips become magnetized in opposite directions. 
A secondary coil wound about the primary pair detects 
the change in magnetic flux in the cores (Fig. 5.41b), 
which is zero as soon as the cores saturate. While the 
primary current is rising or falling, the magnetic flux in 
each strip changes and a voltage is induced in the sec¬ 
ondary coil. If there is no external magnetic field, the 
signals due to the changing flux are equal and opposite 
and no output signal is recorded. When the axis of the 
sensor is aligned with the Earth’s magnetic field, the 
latter is added to the primary field in one strip and sub¬ 
tracted from it in the other. The phases of the magnetic 
flux in the alloy strips are now different; one saturates 
before the other. The flux changes in the two alloy strips 
are no longer equal and opposite. An output voltage is 
produced in the secondary coil that is proportional to 
the strength of the component of the Earth’s magnetic 
field along the axis of the sensor. 

The flux-gate magnetometer is a vector magnetometer, 
because it measures the strength of the magnetic field in a 
particular direction, namely along the axis of the sensor. 
This requires that the sensor be accurately oriented along 
the direction of the field component to be measured. For 
total field measurements three sensors are employed. 
These are fixed at right angles to each other and con¬ 
nected with a feedback system which rotates the entire 
unit so that two of the sensors detect zero field. The mag¬ 
netic field to be measured is then aligned with the axis of 
the third sensor. 

The flux-gate magnetometer does not yield absolute 
field values. The output is a voltage, which must be cali¬ 
brated in terms of magnetic field. However, the instru¬ 
ment provides a continuous record of field strength. Its 
sensitivity of about 1 nT makes it capable of measuring 
most magnetic anomalies of geophysical interest. It is 
robust and adaptable to being mounted in an airplane, or 
towed behind it. The instrument was developed during 
World War II as a submarine detector. After the war it 
was used extensively in airborne magnetic surveying. 


primary secondary 

circuit circuit 



Fig. 5.41 Simplified principle of the flux-gate magnetometer, (a) 

Primary and secondary electrical circuits include coils wrapped around 
parallel strips of Mumetal in opposite and similar senses, respectively. 

(b) The output signal in a magnetic field is proportional to the net rate 
of change of magnetic flux in the Mumetal strips (after Militzer etal., 
1984). 

55.2.2 The proton-precession magnetometer 

Since World War II sensitive magnetometers have been 
designed around quantum-mechanical properties. The 
proton-precession magnetometer depends on the fact 
that the nucleus of the hydrogen atom, a proton, has a 
magnetic moment proportional to the angular momen¬ 
tum of its spin. Because the angular momentum is quan¬ 
tized, the proton magnetic moment can only have 
specified values, which are multiples of a fundamental 
unit called the nuclear magneton. The situation is analo¬ 
gous to the quantization of magnetic moment associated 
with electron spin, for which the fundamental unit is the 
Bohr magneton. The ratio of the magnetic moment to the 
spin angular momentum is called the gyromagnetic ratio 
(y p ) of the proton. It is an accurately known fundamental 
constant with the value y p = 2.675 13 X 10 8 s -1 T -1 . 

The proton-precession magnetometer is simple and 
robust in design. The sensor of the instrument consists of a 
flask containing a proton-rich liquid, such as water. 
Around the flask are wound a magnetizing solenoid and a 
detector coil (Fig. 5.42); some designs use the same sole¬ 
noid alternately for magnetizing and detection. When the 
current in the magnetizing solenoid is switched on, it 
creates a magnetic field of the order of lOmT, which 
is about 200 times stronger than the Earth’s field. The 
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Fig. 5.42 (a) The elements of a proton-precession magnetometer, (b) 
Current in the magnetizing coil produces a strong field Fthat aligns the 
magnetic moments ("spins") of the protons, (c) When the field F is 
switched off, the proton spins precess about the geomagnetic field B v 
inducing an alternating current in the coil with the Larmor precessional 
frequency/. 


magnetizing field aligns the magnetic moments of the 
protons along the axis of the solenoid, which is oriented 
approximately east-west at right angles to the Earth’s field. 
After the magnetizing field is interrupted, the magnetic 
moments of the proton spins react to the couple exerted on 
them by the Earth’s magnetic field. Like a child’s top spin¬ 
ning in the field of gravity, the proton magnetic moments 
precess about the direction of the ambient magnetic field. 
They do so at a rate known as the Larmor precessional fre¬ 
quency. The motion of the magnetic moments induces a 
signal in the detector coil. The induced signal is amplified 
electronically and the precessional frequency is accurately 
measured by counting cycles for a few seconds. The 
strength B t of the measured magnetic field is directly pro¬ 
portional to the frequency of the signal (/), and is given by 

B t = 2 y/ (5.43) 

The intensity of the Earth's magnetic field is in the 
range 30,000-60,000 nT. The corresponding precessional 
frequency is approximately 1250-2500 EIz, which is in the 
audio-frequency range. Accurate measurement of the 


signal frequency gives an instrumental sensitivity of 
about 1 nT, but requires a few seconds of observation. 
Although it gives an absolute value of the field, the 
proton-precession magnetometer does not give a continu¬ 
ous record. Its portability and simplicity give it advan¬ 
tages for field use. 

The flux-gate and proton-precession magnetometers 
are widely used in magnetic surveying. The two instru¬ 
ments have comparable sensitivities of 0.1-1 nT. In 
contrast to the flux-gate instrument, which measures the 
component of the field along its axis, the proton-preces¬ 
sion magnetometer cannot measure field components; it 
is a total-field magnetometer. The total field B t is the 
vector sum of the Earth’s magnetic field B E and the stray 
magnetic field AB of, say, an orebody. Generally, A B<B E , 
so that the direction of the total field does not deviate far 
from the Earth’s field. In some applications it is often ade¬ 
quate to regard the measured total field anomaly as the 
projection of AB along the Earth’s field direction. 

5.5.23 The absorption-cell magnetometer 

The absorption-cell magnetometer is also referred to as 
the alkali-vapor or optically pumped magnetometer. The 
principle of its operation is based on the quantum- 
mechanical model of the atom. According to their 
quantum numbers the electrons of an atom occupy con¬ 
centric shells about the nucleus with different energy 
levels. The lowest energy level of an electron is its ground 
state. The magnetic moment associated with the spin of 
an electron can be either parallel or antiparallel to an 
external magnetic field. The energy of the electron is 
different in each case. This results in the ground state 
splitting into two sublevels with slightly different ener¬ 
gies. The energy difference is proportional to the strength 
of the magnetic field. The splitting of energy levels in the 
presence of a magnetic field is called the Zeeman effect. 

Absorption-cell magnetometers utilize the Zeeman 
effect in vapors of alkali elements such as rubidium or 
cesium, which have only a single valence electron in the out¬ 
ermost energy shell. Consider the schematic representation 
of an alkali-vapor magnetometer in Fig. 5.43. A polarized 
light-beam is passed through an absorption cell containing 
rubidium or cesium vapor and falls on a photoelectric cell, 
which measures the intensity of the light-beam. In the pres¬ 
ence of a magnetic field the ground state of the rubidium or 
cesium is split into two sublevels, and G 2 . If the exact 
amount of energy is added to the vapor, the electrons may 
be raised from their ground state to a higher-energy level, 
H. Suppose that we irradiate the cell with light from which 
we have filtered out the spectral line corresponding to the 
energy needed for the transition G 2 H. The energy for the 
transition GjH has not been removed, so the electrons in 
ground state G t will receive energy that excites them to level 
H, whereas those in ground state G ? will remain in this 
state. The energy for these transitions comes from the inci¬ 
dent light-beam, which is absorbed in the cell. In due 
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Fig. 5.43 The principle of 
operation of the optically 
pumped magnetometer (after 
Telford etal., 1990). 
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course, the excited electrons will fall back to one of the 
more stable ground states. If an electron in excited state H 
falls back to sublevel it will be re-excited into level H; 
but if it falls back to sublevel G-, it will remain there. In 
time, this process - called "optical pumping” - will empty 
sublevel G t and fill level G 2 . At this stage no more energy 
can be absorbed from the polarized light-beam and the 
absorption cell becomes transparent. If we now supply 
electromagnetic energy to the system in the form of a radio¬ 
frequency signal with just the right amount of energy to 
permit transitions between the populated G 2 and unpopu¬ 
lated Gj ground sublevels, the balance will be disturbed. 
The optical pumping will start up again and will continue 
until the electrons have been expelled from the G t level. 
During this time energy is absorbed from the light-beam 
and it ceases to be transparent. 

In the rubidium-vapor and cesium-vapor magnetome¬ 
ters a polarized light-beam is shone at approximately 45° to 
the magnetic field direction. In the presence of the Earth's 
magnetic field the electrons precess about the field direction 
at the Larmor precessional frequency. At one part of the 
precessional cycle an electron spin is almost parallel to the 
field direction, and one half-cycle later it is nearly antiparal¬ 
lel. The varying absorption causes a fluctuation of intensity 
of the light-beam at the Larmor frequency. This is detected 
by the photocell and converted to an alternating current. 
By means of a feedback circuit the signal is supplied to a 
coil around the container of rubidium gas and a radio¬ 
frequency resonant circuit is created. The ambient geomag¬ 
netic field B t that causes the splitting of the ground state is 
proportional to the Larmor frequency, and is given by 

A = yr/ (5.44) 

Here, y e is the gyromagnetic ratio of the electron, which 
is known with an accuracy of about 1 part in 10 7 . It is 


about 1800 times larger than y the gyromagnetic ratio of 
the proton. The precessional frequency is correspondingly 
higher and easier to measure precisely. The sensitivity of 
an optically pumped magnetometer is very high, about 
0.01 nT, which is an order of magnitude more sensitive 
than the flux-gate or proton-precession magnetometer. 

5.5.3 Magnetic surveying 

The purpose of magnetic surveying is to identify and 
describe regions of the Earth’s crust that have unusual 
(anomalous) magnetizations. In the realm of applied geo¬ 
physics the anomalous magnetizations might be associ¬ 
ated with local mineralization that is potentially of 
commercial interest, or they could be due to subsurface 
structures that have a bearing on the location of oil 
deposits. In global geophysics, magnetic surveying over 
oceanic ridges provided vital clues that led to the theory 
of plate tectonics and revealed the polarity history of the 
Earth's magnetic field since the Early Jurassic. 

Magnetic surveying consists of (1) measuring the terres¬ 
trial magnetic field at predetermined points, (2) correcting 
the measurements for known changes, and (3) comparing 
the resultant value of the field with the expected value at 
each measurement station. The expected value of the field 
at any place is taken to be that of the International 
Geomagnetic Reference Field (IGRF), described in Section 
5.4.4. The difference between the observed and expected 
values is a magnetic anomaly. 

5.53.1 Measurement methods 

The surveying of magnetic anomalies can be carried out 
on land, at sea and in the air. In a simple land survey an 
operator might use a portable magnetometer to measure 
the field at the surface of the Earth at selected points that 
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Fig. 5.44 (a) In airborne magnetic surveying the magnetometer may be 
mounted rigidly on the airplane at the end of a boom (A), or towed in 
an aerodynamic housing behind the plane (B). (b) In marine studies the 
magnetometer must be towed some distance d behind the ship to 
escape its magnetic field, (c) A pair of sensitive magnetometers in the 
same vertical plane act as a magnetic gradiometer (after Slack etal., 
1967). 

form a grid over a suspected geological structure. This 
method is slow but it yields a detailed pattern of the mag¬ 
netic field anomaly over the structure, because the mea¬ 
surements are made close to the source of the anomaly. 

In practice, the surveying of magnetic anomalies is most 
efficiently carried out from an aircraft. The magnetometer 
must be removed as far as possible from the magnetic envi¬ 
ronment of the aircraft. This may be achieved by mounting 
the instrument on a fixed boom, A, several meters long 
(Fig. 5.44a). Alternatively, the device may be towed behind 
the aircraft in an aerodynamic housing, B, at the end of a 
cable 30-150 m long. The “bird” containing the magne¬ 
tometer then flies behind and below the aircraft. The flight 
environment is comparatively stable. Airborne magne¬ 
tometers generally have higher sensitivity (=0.01 nT) than 
those used in ground-based surveying (sensitivity = 1 nT). 
This compensates for the loss in resolution due to the 
increased distance between the magnetometer and the 
source of the anomaly. Airborne magnetic surveying is an 


economical way to reconnoitre a large territory in a short 
time. It has become a routine part of the initial phases of 
the geophysical exploration of an uncharted territory. 

The magnetic field over the oceans may also be sur¬ 
veyed from the air. However, most of the marine magnetic 
record has been obtained by shipborne surveying. In the 
marine application a proton-precession magnetometer 
mounted in a waterproof “fish” is towed behind the ship at 
the end of a long cable (Fig. 5.44b). Considering that most 
research vessels consist of several hundred to several thou¬ 
sand tons of steel, the ship causes a large magnetic distur¬ 
bance. For example, a research ship of about 1000 tons 
deadweight causes an anomaly of about 10 nT at a dis¬ 
tance of 150 m. To minimize the disturbance of the ship 
the tow-cable must be about 100-300 m in length. At this 
distance the “fish” in fact “swims” well below the water 
surface. Its depth is dependent on the length of the tow- 
cable and the speed of the ship. At a typical survey speed 
of 10 km h~' its operational depth is about 10-20 m. 

5.53.2 Magnetic gradiometers 

The magnetic gradiometer consists of a pair of alkali-vapor 
magnetometers maintained at a fixed distance from each 
other. In ground-based surveying the instruments are 
mounted at opposite ends of a rigid vertical bar. In airborne 
usage two magnetometers are flown at a vertical spacing of 
about 30 m (Fig. 5.44c). The difference in outputs of the 
two instruments is recorded. If no anomalous body is 
present, both magnetometers register the Earth’s field 
equally strongly and the difference in output signals is zero. 
If a magnetic contrast is present in the subsurface rocks, the 
magnetometer closest to the structure will detect a stronger 
signal than the more remote instrument, and there will be a 
difference in the combined output signals. 

The gradiometer emphasizes anomalies from local 
shallow sources at the expense of large-scale regional 
variation due to deep-seated sources. Moreover, because 
the gradiometer registers the difference in signals from the 
individual magnetometers, there is no need to compen¬ 
sate the measurements for diurnal variation, which affects 
each individual magnetometer equally. Proton-precession 
magnetometers are most commonly used in ground- 
based magnetic gradiometers, while optically pumped 
magnetometers are favored in airborne gradiometers. 

5.5.3.3 The survey pattern 

In a systematic regional airborne (or marine) magnetic 
survey the measurements are usually made according to a 
predetermined pattern. In surveys made with fixed-wing 
aircraft the survey is usually flown at a constant flight ele¬ 
vation above sea-level (Fig. 5.45a). This is the procedure 
favored for regional or national surveys, or for the investi¬ 
gation of areas with dramatic topographic relief. The 
survey focuses on the depth to the magnetic basement, 
which often underlies less magnetic sedimentary surface 
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Fig. 5.45 In airborne magnetic surveying the flight-lines may be flown 
at (a) constant altitude above sea-level, or (b) constant height above 
ground-level. The flight pattern (c) includes parallel measurement lines 
and orthogonal cross-tie lines. 

rocks at considerable depth. In regions that are flat or that 
do not have dramatic topography, it may be possible to fly 
a survey at low altitude, as close as possible to the mag¬ 
netic sources. This method would be suitable over ancient 
shield areas, where the goal of the survey is to detect local 
mineralizations with potential commercial value. If a 
helicopter is being employed, the distance from the mag¬ 
netic sources may be kept as small as possible by flying at 
a constant height above the ground surface (Fig. 5.45b). 

The usual method is to survey a region along parallel 
flight-lines (Fig. 5.45c), which may be spaced anywhere 
from 100 m to a few kilometers apart, depending on the 
flight elevation used, the intensity of coverage, and the 
quality of detail desired. The orientation of the flight- 
lines is selected to be more or less normal to the trend of 
suspected or known subsurface features. Additional tie- 
lines are flown at right angles to the main pattern. Their 
separation is about 5-6 times that of the main flight-lines. 
The repeatability of the measurements at the intersec¬ 
tions of the tie-lines and the main flight-lines provides a 
check on the reliability of the survey. If the differences 
(called closure errors) are large, an area may need to be re¬ 
surveyed. Alternatively, the differences may be distributed 
mathematically among all the observations until the 
closure errors are minimum. 


5.5.4 Reduction of magnetic field measurements 

In comparison to the reduction of gravity data, magnetic 
survey data require very few corrections. One effect that 
must be compensated is the variation in intensity of the 
geomagnetic field at the Earth’s surface during the course 
of a day. As explained in more detail in Section 5.4.3.3 
this diurnal variation is due to the part of the Earth’s mag¬ 
netic field that originates in the ionosphere. At any point 
on the Earth’s surface the external field varies during the 
day as the Earth rotates beneath different parts of the 
ionosphere. The effect is much greater than the precision 
with which the field can be measured. The diurnal varia¬ 
tion may be corrected by installing a constantly recording 
magnetometer at a fixed base station within the survey 
area. Alternatively, the records from a geomagnetic 
observatory may be used, provided it is not too far from 
the survey area. The time is noted at which each field mea¬ 
surement is made during the actual survey and the appro¬ 
priate correction is made from the control record. 

The variations of magnetic field with altitude, latitude 
and longitude are dominated by the vertical and horizontal 
variations of the dipole field. The total intensity 5, of the 
field is obtained by computing the resultant of the radial 
component B r (Eq. (5.38)) and the tangential component 
Bq (Eq. (5.39)): 

, fJtrjn a/ 1 + 3cos 2 0 

B={Bf+Bl = ^ - (5.45) 


The altitude correction is given by the vertical gradient of 
the magnetic field, obtained by differentiating the inten¬ 
sity B t with respect to radius, r. This gives 


dB t n 0 m -\|l + 3cos 2 0 3 
~dr = _3 ^tt 7 4 = 


(5.46) 


The vertical gradient of the field is found by substituting 
r = R = 6371 km and an appropriate value for B v It clearly 
depends on the latitude of the measurement site. At the 
magnetic equator (Z? t = 30,000 nT) the altitude correction 
is about 0.015 nT m _1 ; near the magnetic poles 
(B t ~ 60,000 nT) it is about 0.030 nT m -1 . The correction 
is so small that it is often ignored. 

In regional studies the corrections for latitude and longi¬ 
tude are inherent in the reference field that is subtracted. In 
a survey of a small region, the latitude correction is given 
by the north-south horizontal gradient of the magnetic 
field, obtained by differentiating B t with respect to polar 
angle, 0. This gives for the northward increase in B t (i.e., 
with increasing latitude) 


1 dB, Horn I d rr —^ -^ 35, sind cos 0 

r d0 4-tt r 4 dd^ 1 + 3cos 6 r(l+3cos 2 0) 


(5.47) 


The latitude correction is zero at the magnetic pole (0 = 0°) 
and magnetic equator (0=90°) and reaches a maximum 
value of about 5 nT per kilometer (0.005 nT m -1 ) at inter¬ 
mediate latitudes. It is insignificant in small-scale surveys. 
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In some land-based surveys of highly magnetic terrains 
(e.g., over lava flows or mineralized intrusions), the disturb¬ 
ing effect of the magnetized topography may be serious 
enough to require additional topographic corrections. 

5.5.5 Magnetic anomalies 

The gravity anomaly of a body is caused by the density 
contrast (Ap) between the body and its surroundings. 
The shape of the anomaly is determined by the shape of 
the body and its depth of burial. Similarly, a magnetic 
anomaly originates in the magnetization contrast (A M) 
between rocks with different magnetic properties. 
However, the shape of the anomaly depends not only on 
the shape and depth of the source object but also on its 
orientation to the profile and to the inducing magnetic 
field, which itself varies in intensity and direction with 
geographical location. In oceanic magnetic surveying 
the magnetization contrast results from differences in the 
remanent magnetizations of crustal rocks, for which the 
Konigsberger ratio is much greater than unity (i.e., 
Q n > 1). Commercial geophysical prospecting is carried 
out largely in continental crustal rocks, for which the 
Konigsberger ratio is much less than unity (i.e., Q n < 1) 
and the magnetization may be assumed to be induced by 
the present geomagnetic field. The magnetization con¬ 
trast is then due to susceptibility contrast in the crustal 
rocks. If k represents the susceptibility of an orebody, k Q 
the susceptibility of the host rocks and F the strength of 
the inducing magnetic field, Eq. (5.17) allows us to write 
the magnetization contrast as 

A M=(k-k 0 )F (5.48) 

Some insight into the physical processes that give rise to 
a magnetic anomaly can be obtained from the case of a 
vertically sided body that is magnetized by a vertical mag¬ 
netic field. This is a simplified situation because in practice 
both the body and the field will be inclined, probably at 
different angles. However, it allows us to make a few obser¬ 
vations that are generally applicable. Two scenarios are of 
particular interest. The first is when the body has a large 
vertical extent, such that its bottom surface is at a great 
depth; the other is when the body has a limited vertical 
extent. In both cases the vertical field magnetizes the body 
parallel to its vertical sides, but the resulting anomalies 
have different shapes. To understand the anomaly shapes 
we will use the concept of magnetic pole distributions. 

5.5.5.1 Magnetic anomaly of a surface distribution of 
magnetic poles 

Although magnetic poles are a Active concept (see Section 
5.2.2.1), they provide a simple and convenient way to 
understand the origin of magnetic field anomalies. If a 
slice is made through a uniformly magnetized object, 
simple logic tells us that there will be as many south poles 
per unit of area on one side of the slice as north poles on 


the opposite side; these will cancel each other and the net 
sum of poles per unit area of the surface of the slice is 
zero. This is no longer the case if the magnetization 
changes across the interface. On each unit area of the 
surface there will be more poles of the stronger magneti¬ 
zation than poles of the weaker one. A quantitative 
derivation shows that the resultant number of poles per 
unit area tr (called the surface density of poles) is propor¬ 
tional to the magnetization contrast AM. 

The concept of the solid angle subtended by a surface 
element (Box 5.4) provides a qualitative understanding of 
the magnetic anomaly of a surface distribution of mag¬ 
netic poles. Consider the distribution of poles on the upper 
surface with area A of a vertical prism with magnetization 
Minduced by a vertical field B_, as illustrated in Fig. 5.46a. 
At the surface of the Earth, distant r from the distribution 
of poles, the strength of their anomalous magnetic field is 
proportional to the total number of poles on the surface, 
which is the product of A and the surface density cr of 
poles. Equation (5.2) shows that the intensity of the field of 
a pole decreases as the inverse square of distance r. If the 
direction of r makes an angle 9 with the vertical magnetiza¬ 
tion M, the vertical component of the anomalous field at P 
is found by multiplying by cosd. The vertical magnetic 
anomaly A B_ of the surface distribution of poles is 

A5.0C ((rA ^ os0 oc (AM)a (5.49) 

A more rigorous derivation leads to essentially the 
same result. At any point on a measurement profile, the 
magnetic anomaly A B. of a distribution of poles is pro¬ 
portional to the solid angle II subtended by the distribu¬ 
tion at the point. The solid angle changes progressively 
along a profile (Fig. 5.46b). At the extreme left and right 
ends, the radius from the observation point is very oblique 
to the surface distribution of poles and the subtended 
angles Ilj and Il 4 are very small; the anomaly distant from 
the body is nearly zero. Over the center of the distribution, 
the subtended angle reaches its largest value fl Q and the 
anomaly reaches a maximum. The anomaly falls smoothly 
on each side of its crest corresponding to the values of the 
subtended angles Il 2 and Il 3 at the intermediate positions. 
A measurement profile across an equal distribution of 
“north” poles would be exactly inverted. The north poles 
create a field of repulsion that acts everywhere to oppose 
the Earth’s magnetic field, so the combined field is less 
than it would be if the “north” poles were not there. The 
magnetic anomaly over “north” poles is negative. 

5.5.52 Magnetic anomaly of a vertical dike 

We can now apply these ideas to the magnetic anomaly of 
a vertical dike. In this and all following examples we will 
assume a two-dimensional situation, where the horizon¬ 
tal length of the dike (imagined to be into the page) is 
infinite. This avoids possible complications related to 
“end effects.” Let us first assume that the dike extends to 
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Box 5.4: Solid angles 

A solid angle is defined by the ratio between the area 
of an element of the surface of a sphere and the radius 
of the sphere. Let the area of a surface element be A 
and the radius of the sphere be r, as in Fig. B5.4. The 
solid angle ft subtended by the area A at the center of 
the sphere is defined as 


ft = 4 

r L 


The angle subtended by any surface can be deter¬ 
mined by projecting the surface onto a sphere. The 
shape of the area is immaterial. If an element of area 
A is inclined at angle a to the radius r through a point 
on the surface, its projection normal to the radius (i.e., 
onto a sphere passing through the point) is ^cosa, and 
the solid angle it subtends at the center of the sphere is 
given by 

« Acosa 


A solid angle is measured in units of steradians, 
which are analogous to radians in planar geometry. 
The minimum value of a solid angle is zero, when the 
surface element is infinitesimally small. The maximum 
value of a solid angle is when the surface completely 
surrounds the center of the sphere. The surface area of 
a sphere of radius r is A =4nr 2 and the solid angle 
at its center has the maximum possible value, which 
is 4 tt. 




Fig. B5.4 Definition of the solid angle (l subtended by an area A 
on the surface of a sphere with radius r. 


(a) 



Fig. 5.46 Explanation of the magnetic anomaly of a vertical prism with 
infinite depth extent. For simplicity the magnetization M and inducing 
field B z are both assumed to be vertical, (a) The distribution of magnetic 
poles on the top surface of the prism subtends an angle fl at the point 
of measurement, (b) The magnetic anomaly A B z varies along a profile 
across the prism with the value of the subtended angle ft. 

very great depths (Fig. 5.47a), so that we can ignore the 
small effects associated with its remote lower end. The 
vertical sides of the dike are parallel to the magnetization 
and no magnetic poles are distributed on these faces. 
However, the horizontal top face is normal to the magne¬ 
tization and a distribution of magnetic poles can be 
imagined on this surface. The direction of magnetization 
is parallel to the field, so the pole distribution will consist 
of “south” poles. The magnetized dike behaves like a 
magnetic monopole. At any point above the dike we 
measure both the inducing field and the anomalous “stray 
field” of the dike, which is directed toward its top. The 
anomalous field has a component parallel to the Earth’s 
field and so the total magnetic field will be everywhere 
stronger than if the dike were not present. The magnetic 
anomaly is everywhere positive, increasing from zero 
far from the dike to a maximum value directly over it 
(Fig. 5.46b). 

If the vertical extent of the dike is finite, the distribution 
of north poles on the bottom of the dike may be close 
enough to the ground surface to produce a measurable 
stray field. The upper distribution of south poles causes a 
positive magnetic anomaly, as in the previous example. The 
lower distribution of north poles causes a negative 
anomaly (Fig. 5.47b). The north poles are further from the 
magnetometer than the south poles, so their negative 
anomaly over the dike is weaker. However, farther along 
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Fig. 5.47 (a) The vertical-field 
magnetic anomaly over a 
vertically magnetized block 
with infinite depth extent is 
due only to the distribution of 
poles on the top surface, (b) If 
the block has finite depth 
extent, the pole distributions 
on the top and bottom 
surfaces both contribute to 
the anomaly. 


(a) infinite depth extent: (b) finite depth extent: 

monopole model dipole model 



the profile the deeper distribution of poles subtends a 
larger angle than the upper one does. As a result, the 
strength of the weaker negative anomaly does not fall off 
as rapidly along the profile as the positive anomaly does. 
Beyond a certain lateral distance from the dike (to the left 
of L and to the right of R in Fig. 5.47b) the negative 
anomaly of the lower pole distribution is stronger than the 
positive anomaly of the upper one. This causes the mag¬ 
netic anomaly to have negative side lobes, which asymptot¬ 
ically approach zero with increasing distance from the dike. 

The magnetized dike in this example resembles a bar 
magnet and can be modelled crudely by a dipole. Far from 
the dike, along a lateral profile, the dipole field lines have a 
component opposed to the inducing field, which results in 
the weak negative side lobes of the anomaly. Closer to the 
dike, the dipole field has a component that reinforces the 
inducing field, causing a positive central anomaly. 

5.5.53 Magnetic anomaly of an inclined magnetization 

When an infinitely long dike is magnetized obliquely rather 
than vertically, its anomaly can be modelled either by an 
inclined dipole or by pole distributions (Fig. 5.48). The 
magnetization has both horizontal and vertical compo¬ 
nents, which produce magnetic pole distributions on the 
vertical sides of the dike as well as on its top and bottom. 
The symmetry of the anomaly is changed so that the nega¬ 
tive lobe of the anomaly is enhanced on the side towards 
which the horizontal component of magnetization points; 
the other negative lobe decreases and may disappear. 

The shape of a magnetic anomaly also depends on the 
angle at which the measurement profile crosses the dike, 
and on the strike and dip of the dike. The geometry, mag¬ 
netization and orientation of a body may be taken into 
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Fig. 5.48 Explanation of the origin of the magnetic anomaly of an 
infinitely long vertical prism in terms of the pole distributions on top, 
bottom and side surfaces, when the magnetic field (or magnetization) is 
inclined. 

account in forward-modelling of an anomaly. Flowever, 
as in other potential field methods, the inverse problem of 
determining these factors from the measured anomaly is 
not unique. 
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(a) magnetic anomaly map (b) after reduction to the pole 



Fig. 5.49 Effect of data-processing by reduction to the pole on the 
magnetic anomaly of a small vertical prism with an inclined 
magnetization. In (a) the contour lines define a dipole type of anomaly 
with regions of maximum and minimum intensity (in nT); in (b) the 
anomaly after reduction to the pole is much simpler and constrains 
better the location of the center of the prism (after Lindner etal., 

1984). 

The asymmetry (or skewness) of a magnetic anomaly 
can be compensated by the method of reduction to the pole. 
This consists of recalculating the observed anomaly for the 
case that the magnetization is vertical. The method involves 
sophisticated data-processing beyond the scope of this 
text. The observed anomaly map is first converted to a 
matrix of values at the intersections of a rectangular grid 
overlying the map. The Fourier transform of the matrix is 
then computed and convolved with a filter function to 
correct for the orientations of the body and its magnetiza¬ 
tion. The reduction to the pole removes the asymmetry of 
an anomaly (Fig. 5.49) and allows a better location of the 
margins of the disturbing body. Among other applications, 
the procedure has proved to be important for detailed inter¬ 
pretation of the oceanic crustal magnetizations responsible 
for lineated oceanic magnetic anomalies. 

5.55.4 Magnetic anomalies of simple geometric bodies 

The computation of magnetic anomalies is generally more 
complicated than the computation of gravity anomalies. In 
practice, iterative numerical procedures are used. However, 
the Poisson relation (Box 5.5) enables the computation of 
magnetic anomalies for bodies for which the gravity 
anomaly is known. This is most easily illustrated for verti¬ 
cally magnetized bodies, such as the following examples. 

(1) Sphere. The gravity anomaly A g. over a sphere of 
radius R with density contrast Ap and center at depth 
z (representing a diapir or intrusion) is given by Eq. 
(2.83), repeated here: 


Ag_. 3 irGApR 3 _|_ x 2) 3/2 

Assuming the same dimensions and a magnetiza¬ 
tion contrast AM 7 , the potential of the magnetic 
anomaly over a vertically magnetized sphere accord¬ 
ing to the Poisson relation is 


W= 


Ur, ( A M.\ 1 
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(5.50) 


By differentiating with respect to x or z we get the 
horizontal or vertical field anomaly, respectively. 
The vertical field magnetic anomaly AB_ of the 
sphere is 


A £_ = - 
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(5.52) 


(2) Horizontal cylinder. The gravity anomaly A g, over a 
cylinder of radius R with horizontal axis centered at 
depth z and with density contrast Ap (representing an 
anticline or syncline) is given by Eq. (2.93). If the 
structure is vertically magnetized with magnetization 
contrast AM., Poisson’s relation gives for the mag¬ 
netic potential 


W — 


V o 
4tt 



- 2 p 0 R 2 AM : _ 2 ' 


(5.53) 


The vertical magnetic field anomaly AZF over the 
horizontal cylinder is 

AB z = 2 ^ 0 ^ 2AM z (^ 2 ^ 2)2 (5.54) 

(3) Horizontal crustal block. The gravity anomaly for a 
thin horizontal sheet of thickness t at depth d between 
horizontal positions Xj and x 2 (Fig. 2.54 b), extending 
to infinity normal to the plane of observation, is given 
by Eq. (2.96). Let the width of the block be 2m, and 
let the horizontal position be measured from the mid¬ 
point of the block, so that Xj = x —m and x 2 — x + in. 
Applying Poisson’s relation, we get the magnetic 
potential for a semi-infinite horizontal thin sheet of 
vertically magnetized dipoles, of thickness t at depth z 


( AM z) . PA M -. 
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Box 5.5: Poisson's relation 


Poisson (1781-1840) observed a relationship between the 
gravitational and magnetic potentials of a body, which 
allows a simple method of computing magnetic field 
anomalies if the gravity anomaly of the body is known. 
Consider an arbitrary volume F with homogeneous 
density and vertical magnetization (Fig. B5.5). If the 
density of the body is Ap a small element with volume d F 
has mass (ApdF). The gravitational potential U at a 
point on the surface at a distance r from the element is 


U = -G 


ApdF 

r 


( 1 ) 


The vertical gravity anomaly Ag z of the volume 
element is found by differentiating U with respect to z 




ApdFj 


GApdF^ 



( 2 ) 


If the body is vertically magnetized with uniform 
magnetization AM., the magnetic moment of the 
volume element is (AM. dF). The magnetic moment is 
directed downward as in Fig. B5.5. The radius vector 
from the element to a point on the surface makes an 
angle (tt - 0) with the orientation of the magnetization. 
The magnetic potential W at the point (r, 6) is 



Fig. B5.5 Definition of parameters used in the derivation of Poisson's 
relation. 


Substituting this result in (3) gives 


po AM. d Fcos ( tt — 6) 
W= 4^ f. 


Comparing Eq. (2) and Eq. (5) and eliminating the 
volume A F we get Poisson's relation: 


p o AM.dFcos0 p 0 AM. dF/-\ 
4tt r 2 4-tt r 2 v/ 


(3) 




Note the following relationship 



This derivation for a small element is also valid for an 
extended body as long as the density and magnetization 
are both uniform. 


Assume that the horizontal crustal block is made 
up of layers of thickness t = dz. If the top of the 
block is at depth z, and its base at depth z 2 , the mag¬ 
netic potential of the block is found by integrating 
Eq. (5.55) between z 0 and z v 
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Differentiating with respect to z gives the vertical 
magnetic field anomaly over the block: 
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where the angles a v a 2 , a 3 and a 4 are defined in Fig. 
5.50a. Note that the angles (c^ - a 2 ) and (a 3 - a 4 ) are 
the planar angles subtended at the point of measure¬ 
ment by the top and bottom edges of the verti¬ 
cally magnetized crustal block respectively. This is 
similar to the dependence of magnetic anomalies of 
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Fig. 5.50 The effect of block width on the shape of the magnetic 
anomaly over a vertically magnetized thin crustal block, (a) The block 
has width w = 2m, and its top and bottom surfaces are at depths z, and 
z 2 , respectively. For each of the calculated anomalies in (b), (c) and (d) 
these depths arez, =2.5 km and z 2 = 3 km, and only the width of the 
block is varied. The amplitude of the anomaly is in arbitrary units. 

three-dimensional bodies on solid angles subtended 
by surfaces of the body at the point of observation, 
as illustrated in Fig. 5.46. As was the case for gravity 
surveys, magnetic profiles normal to the strike of 
elongate bodies may be regarded as two-dimen¬ 
sional, as long as the third dimension of the body is 
large enough for variations normal to the profile to 
be negligible. 

55.5.5 Effect of block width on anomaly shape 

The effect of the width of a crustal block on anomaly 
shape is illustrated by use of Eq. (5.59) to model the verti¬ 
cal field magnetic anomaly of a vertically magnetized 
block with its top at depth 2.5 km and base at depth 3 km. 
The block is effectively a thin magnetized layer, similar to 
the source of oceanic magnetic anomalies. Three cases are 
considered here: a narrow block of width \v = (2m) = 5 km 


for which mlz l = 1, a block of width 10 km (mlz 1 = 2), and 
a wide block of width 40 km (mlz x = 8). 

The narrowest block gives a sharp, positive central 
anomaly with negative side lobes (Fig. 5.50b), as 
explained in Section 5.5.5.2. As the block widens with 
respect to its depth, the top of the central anomaly flat¬ 
tens (Fig. 5.50c), its amplitude over the middle of the 
block decreases, and the negative side lobes grow. When 
the block is much wider than the depth to its top (Fig. 
5.50d), a dip develops over the center of the block. The 
positive anomalies are steep sided and are maximum just 
within the edges of the block, while the null values occur 
close to the edges of the block. The negative side anom¬ 
alies are almost as large as the positive anomalies. 

The pronounced central dip in the anomaly is due to 
the limited vertical thickness of the layer. If the layer is 
very wide relative to its thickness, the central anomaly 
may diminish almost to zero. This is because the angle 
(n 3 ~ a 4 ) subtended by the magnetized base of the layer is 
almost (but not quite) as large as the angle (a x - a 7 ) sub¬ 
tended by the top of the layer. For a very large width-to- 
thickness ratio, the central anomaly is zero, the edge 
anomalies separate and become equivalent to separate 
anomalies over the edges of the block. 

Examination of Fig. 5.50a shows that the subtended 
angles (a l - a 2 ) and (« 3 - n 4 ), and thus the anomaly 
shape, depend also on the height of the measurement 
profile above the surface of the block. A low-altitude 
profile over the block will show a large central dip, while a 
high-altitude profile over the same block will show a 
smaller dip or none at all. 

In contrast to the example of a thin layer described 
above, if the crustal block is very thick, extending to great 
depth, the angle (a 3 - a 4 ) is zero and the effects of the 
magnetization discontinuity (or pole distribution) on its 
base are absent. The shape of the anomaly is then deter¬ 
mined by (ctj - a 2 ) and is flat topped over a wide block. 

5.5.6 Oceanic magnetic anomalies 

In the late 1950s marine geophysicists conducting mag¬ 
netic surveys of the Pacific ocean basin off the west coast 
of North America discovered that large areas of oceanic 
crust are characterized by long stripes of alternating posi¬ 
tive and negative magnetic anomalies. The striped pattern 
is best known from studies carried out across oceanic 
ridge systems (see Fig. 1.13). The striped anomalies are 
hundreds of kilometers in length parallel to the ridge axis, 
10-50 km in width, and their amplitudes amount to 
several hundreds of nanotesla. On magnetic profiles per¬ 
pendicular to a ridge axis the anomaly pattern is found to 
exhibit a remarkable symmetry about the axis of the 
ridge. The origin of the symmetric lineated anomaly 
pattern cannot be explained by conventional methods of 
interpretation based on susceptibility contrast. 

Seismic studies indicate a layered structure for the 
oceanic crust. The floor of the ocean lies at water depths 
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Fig. 5.51 Explanation of the 
shape of a magnetic profile 
across an oceanic spreading 
center: (a) the anomalies of 
individual oppositely 
magnetized crustal blocks on 
one side of the ridge, (b) 
overlap of the individual 
anomalies, (c) the effect for 
the opposite sequence of 
blocks on the other side of 
the ridge, and (d) the 
complete anomaly profile. 



of 2-5 km, and is underlain by a layer of sediment of 
variable thickness, called seismic Layer 1. Under the sedi¬ 
ments lie a complex of basaltic extrusions and shallow 
intrusions, about 0.5 km thick, forming seismic Layer 2A, 
under which are found the deeper layers of the oceanic 
crust consisting of a complex of sheeted dikes (Layer 2B) 
and gabbro (Layer 3). The magnetic properties of these 
rocks were first obtained by studying samples dredged 
from exposed crests and ridges of submarine topography. 
The rocks of Layers 2B and 3 are much less magnetic than 
those of Layer 2A. Samples of pillow basalt dredged near 
to oceanic ridges have been found to have moderate sus¬ 
ceptibilities for igneous rocks, but their remanent magne¬ 
tizations are intense. Their Konigsberger ratios are 
commonly in the range 5-50 and frequently exceed 100. 
Recognition of these properties provided the key to 
understanding the origin of the lineated magnetic anom¬ 
alies. In 1963 the English geophysicists F. J. Vine and D. 
H. Matthews proposed that the remanent magnetizations 
(and not the susceptibility contrast) of oceanic basaltic 
Layer 2 were responsible for the striking lineated anomaly 
pattern. This hypothesis soon became integrated into a 
working model for understanding the mechanism of sea¬ 
floor spreading (see Section 1.2.5 and Fig. 1.14). 

The oceanic crust formed at a spreading ridge acquires 
a thermoremanent magnetization (TRM) in the geomag¬ 
netic field. The basalts in Layer 2A are sufficiently 
strongly magnetized to account for most of the anomaly 
measured at the ocean surface. For a lengthy period of 
time (measuring several tens of thousands to millions of 
years) the polarity of the field remains constant; crust 
formed during this time carries the same polarity as the 
field. After a polarity reversal, freshly formed basalts 
acquire a TRM parallel to the new field direction, i.e., 
opposite to the previous TRM. Adjacent oceanic crustal 


blocks of different widths, determined by the variable 
time between reversals, carry antiparallel remanent mag¬ 
netizations. 

The oceanic crust is magnetized in long blocks parallel 
to the spreading axis, so the anomaly calculated for a 
profile perpendicular to the axis is two dimensional, as in 
the previous examples. Consider the case where the 
anomalies on a profile have been reduced to the pole, so 
that their magnetizations can be taken to be vertical. We 
can apply the concept of magnetic pole distributions to 
each block individually to determine the shape of its mag¬ 
netic anomaly (Fig. 5.51a). If the blocks are contiguous, 
as is the case when they form by a continuous process 
such as sea-floor spreading, their individual anomalies 
will overlap (Fig. 5.51b). The spreading process is sym¬ 
metric with respect to the ridge axis, so a mirror image of 
the sequence of polarized blocks is formed on the other 
side of the axis (Fig. 5.51c). If the two sets of crustal 
blocks are brought together at the spreading axis, a mag¬ 
netic anomaly sequence ensues that exhibits a symmetric 
pattern with respect to the ridge axis (Fig. 5.5Id). 

This description of the origin of oceanic magnetic 
anomalies is over-simplified, because the crustal magneti¬ 
zation is more complicated than assumed in the block 
model. For example, the direction of the remanent magne¬ 
tization, acquired at the time of formation of the ocean 
crust, is generally not the same as the direction of the mag¬ 
netization induced by the present-day field. However, the 
induced magnetization has uniformly the same direction in 
the magnetized layer, which thus behaves like a uniformly 
magnetized thin horizontal sheet and does not contribute 
to the magnetic anomaly. Moreover, oceanic rocks have 
high Konigsberger ratios, and so the induced magnetiza¬ 
tion component is usually negligible in comparison to the 
remanent magnetization. An exception is when a magnetic 



334 Geomagnetism and paleomagnetism 


survey is made close to the magnetized basalt layer, in 
which case a topographic correction may be needed. 

Unless the strike of a ridge is north-south, the magne¬ 
tization inclination must be taken into account. Skewness 
is corrected by reducing the magnetic anomaly profile to 
the pole (Section 5.5.5.3). A possible complication may 
arise if the oceanic magnetic anomalies have two sources. 
The strongest anomaly source is doubtless basaltic Layer 
2B, but, at least in some cases, an appreciable part of the 
anomaly may arise in the deeper gabbroic Layer 3. The 
two contributions are slightly out of phase spatially, 
because of the curved depth profiles of cooling isotherms 
in the oceanic crust. This causes a magnetized block in the 
deeper gabbroic layer to lie slightly further from the ridge 
than the corresponding block with the same polarity in 
the basaltic layer above it. The net effect is an asymmetry 
of inclined magnetization directions on opposite sides of 
a ridge, so that the magnetic anomalies over blocks of the 
same age have different skewnesses. 


5.6 PALEOMAGNETISM 

5.6.1 Introduction 

A mountain walker using a compass to find his way in the 
Swiss Alps above the high mountain valley of the 
Engadine would notice that in certain regions (for 
example, south of the Septimer Pass) the compass-needle 
shows very large deviations from the north direction. The 
deflection is due to the local presence of strongly magne¬ 
tized serpentinites and ultramafic rocks. Early compasses 
were more primitive than modern versions, but the falsifi¬ 
cation of a compass direction near strongly magnetic out¬ 
crops was known by at least the early nineteenth century. 
In 1797 Alexander von Humboldt proposed that the 
rocks in these unusual outcrops had been magnetized by 
lightning strikes. The first systematic observations of rock 
magnetic properties are usually attributed to A. Delesse 
(1849) and M. Melloni (1853), who concluded that vol¬ 
canic rocks acquired a remanent magnetization during 
cooling. After a more extensive series of studies in 1894 
and 1895 of the origin of magnetism in lavas, G. 
Folgerhaiter reached the same conclusion and suggested 
that the direction of remanent magnetization was that of 
the geomagnetic field during cooling. By 1899 he had 
extended his work to the record of the secular variation of 
inclination in ancient potteries. Folgerhaiter noted that 
some rocks have a remanent magnetization opposite to 
the direction of the present-day field. Reversals of polar¬ 
ity of the geomagnetic field were established decisively 
early in the twentieth century. 

In 1922 Alfred Wegener proposed his concept of conti¬ 
nental drift, based on years of study of paleoclimatic indi¬ 
cators such as the geographic distribution of coal 
deposits. At the time, there was no way of explaining 
the mechanism by which the continents drifted. Only 
motions of the crust were considered, and the idea of rigid 


continents ploughing through rigid oceanic crust was 
unacceptable to geophysicists. There was as yet no way to 
reconstruct the positions of the continents in earlier eras 
or to trace their relative motions. Subsequently, paleomag¬ 
netism was to make important contributions to under¬ 
standing continental drift by providing the means to trace 
past continental motions quantitatively. 

A major impetus to these studies was the invention of a 
very sensitive astatic magnetometer. The apparatus con¬ 
sists of two identical small magnets mounted horizontally 
at opposite ends of a short rigid vertical bar so that the 
magnets are oriented exactly antiparallel to each other. 
The assembly is suspended on an elastic fiber. In this con¬ 
figuration the Earth’s magnetic field has equal and oppo¬ 
site effects on each magnet. If a magnetized rock is brought 
close to one magnet, the magnetic field of the rock pro¬ 
duces a stronger twisting effect on the closer magnet than 
on the distant one and the assembly rotates to a new posi¬ 
tion of equilibrium. The rotation is detected by a light 
beam reflected off a small mirror mounted on the rigid bar. 
The device was introduced in 1952 by R M. S. Blackett to 
test a theory that related the geomagnetic field to the 
Earth’s rotation. The experiment did not support the pos¬ 
tulated effect. However, the astatic magnetometer became 
the basic tool of paleomagnetism and fostered its develop¬ 
ment as a scientific discipline. Hitherto it had only been 
possible to measure magnetizations of strongly magnetic 
rocks. The astatic magnetometer enabled the accurate mea¬ 
surement of weak remanent magnetizations in rocks that 
previously had been unmeasurable. 

In the 1950s, several small research groups were 
engaged in determining and interpreting the directions of 
magnetization of rocks of different ages in Europe, Africa, 
North and South America and Australia. In 1956 S. K. 
Runcorn put forward the first clear geophysical evidence 
in support of continental drift. Runcorn compared the 
directions of magnetization of Permian and Triassic rocks 
from Great Britain and North America. He found that the 
paleomagnetic results from the different continents could 
be brought into harmony for the time before 200 Ma ago 
by closing the Atlantic ocean. The evaluation of the scien¬ 
tific data was statistical and at first was regarded as con¬ 
troversial. However, Mesozoic paleomagnetic data were 
soon obtained from the southern hemisphere that also 
argued strongly in favor of the continental drift hypothe¬ 
sis. In 1957 E. Irving showed that paleomagnetic data con¬ 
formed better with geological reconstructions of earlier 
positions of the continents than with their present-day 
distribution. Subsequently, numerous studies have docu¬ 
mented the importance of paleomagnetism as a chronicle 
of past motions of global plates and as a record of the 
polarity history of the Earth’s magnetic field. 

5.6.2 The time-averaged geomagnetic field 

A fundamental assumption of paleomagnetism is that the 
time-averaged geomagnetic field corresponds to that of 
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an axial geocentric dipole. The data in support of this 
important hypothesis come partly from studies of secular 
variation and partly from paleomagnetic observations in 
young rocks and sediments. 

The dipole and non-dipole parts of the historic geo¬ 
magnetic field are known to change slowly with time. 
Spherical harmonic analysis of the geomagnetic field (see 
Section 5.4.5.1) shows that the axis of the geocentric 
inclined dipole has drifted slowly westward at about 
0.044-0.14° yr^ 1 in the last 400 yr (see Fig. 5.36). This 
would correspond to a complete circuit about the rotation 
axis in 2500-8000 yr. The rates of westward drift of the 
historic non-dipole field are around 0.22-0.66° yr -1 (see 
Fig. 5.37), giving a periodicity of 550-1650 yr. However, 
it is important to keep in mind that the historic records of 
the secular variation of the geomagnetic field cover only a 
fragment of a complete circuit, which is not enough to 
confirm cyclicity or to estimate a period. The record of 
earlier magnetic field intensity and direction must be 
inferred from archeomagnetism. 

5.6.2.1 Archeomagnetic records of secular variation 

Paleomagnetism is the study of the geomagnetic field 
recorded in rock magnetizations; archeomagnetism is the 
study of the geomagnetic field recorded in dateable his¬ 
toric artefacts. The age of an archeological relict, such as 
a pot or vase, can often be determined with reliable preci¬ 
sion. The pot, and the oven in which it was fired, may 
carry a thermoremanent magnetization (TRM) acquired 
during cooling. The direction of the TRM can be mea¬ 
sured easily and, if the attitude of the pot during firing is 
known or can be assumed, the inclination of the ancient 
magnetic field in which the artefact was made can be 
deduced. The same considerations apply to lava flows that 
can be dated from historic records. 

The secular record of paleoinclination during the past 
2000 yr is available for two regions in which many archeo- 
magnetic studies have been carried out: southeastern 
Europe and southwestern Japan. These regions are 110° 
apart in longitude but lie in similar latitude ranges, 
35^10°N. Smoothed curves through the observations 
show pseudo-cyclical changes with several maxima and 
minima (Fig. 5.52). The shapes of the curves are not dis¬ 
tinctive, so correlation of individual extreme values is 
risky; however, comparison of the four numbered 
maxima and minima in the last 1400 yr suggests that the 
extreme values appear to occur about 400 yr earlier in 
Japan than in Europe. The equivalent period for a full 
circuit of the globe is 1300 yr. The pseudo-cyclicity is 
interpreted as the effect of westward drift of foci of the 
non-dipole field past the sampling site. More detailed 
analysis of the archeomagnetic data shows that the drift 
rates vary with the latitude of the observation site (see 
Fig. 5.37). The mean drift rate is 0.38° longitude per year, 
which is faster than the rate deduced from recent secular 
variation. 



Fig. 5.52 Secular variation of geomagnetic inclination from 
archeomagnetic studies in southeastern Europe and southwestern 
Japan (after Merrill and McElhinny, 1983). 

A subtle magnetic technique devised by Thellier in 
1937 permits determination of the intensity of the mag¬ 
netic field in which an object acquires a TRM. If it is 
assumed that the field was a dipole field, the strength of 
the dipole magnetic moment can be inferred. When 
applied to rocks or ancient artifacts, this type of analysis 
is called a paleointensity determination. The variation in 
strength of the geomagnetic field during the past 7000 yr 
is shown by the results of 3188 paleointensity measure¬ 
ments on dated archeological and geological samples 
(Fig. 5.53). Older paleointensity data exist, but their 
number is too small to be included without distorting the 
record. The number of artifacts decreases as one goes 
back in time, so to obtain significant mean values the data 
are grouped in 1000 yr intervals before 2000 BC and in 
500 yr intervals after 2000 BC. This introduces a problem, 
because these time intervals are too short for the geomag¬ 
netic field to average to a dipole field. The archeological 
relict records the total field at the time of cooling, which 
contains a substantial non-dipole component. However, 
by combining the 3188 paleointensity data with 13,080 
inclination and 16,085 declination data from lake sedi¬ 
ments, a global field model has been calculated with 
Schmidt coefficients (Eq. (5.34)) up to order and degree 
10. The coefficients with n = 1 from this model give the 
dipole magnetic moment (Eq. (5.41)). The analysis yields 
a smooth continuous record of the variation of the 
strength of the geomagnetic dipole during the past 
7000yr (Fig. 5.53). This record is displaced to slightly 
lower values than the direct measurements of paleointen¬ 
sity represented by the boxes, because it is free of non¬ 
dipole components. The dipole moment has fluctuated in 
the past 7000 years, but the record is too short to establish 
whether the changes are cyclical. 
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Fig. 5.53 Secular variation of 
the virtual geocentric dipole 
moment. The mean 
paleointensity of each time 
interval is plotted at the mid¬ 
point of each box, whose 
height represents the 95% 
confidence limits of the mean. 
The numbers indicate how 
many data were averaged in 
each time interval. The 
continuous curve is obtained 
from a global spherical 
harmonic analysis and the 
results are smoothed with a 
spline function (after Korte 
and Constable, 2005). 



Year 


Westward drift of the dipole field implies systematic 
changes in the equatorial components of the dipole. If the 
changes are approximately cyclical, the mean long-term 
strength of the equatorial dipole measured at any given 
site would average to zero within a few multiples of 10 4 yr. 
Similarly, if the secular variation of the non-dipole field 
can be assumed to be roughly periodic its mean value 
should average to zero within a few multiples of 10 3 yr. 
According to these arguments, the only long-term com¬ 
ponent of the geomagnetic field that persists and is not 
averaged to zero within a few tens of thousands of years 
is the axial dipole component. The long-term equivalence 
of the Earth’s magnetic field with that of a dipole located 
at the center of the Earth and oriented along the rotation 
axis is a fundamental tenet of paleomagnetism; it is called 
the axial geocentric dipole hypothesis. 

5.62.2 The axial geocentric dipole hypothesis 

The evidence in support of the axial geocentric dipole 
hypothesis conies from paleomagnetic studies in modern 
deep-sea sediments and young igneous and sedimentary 
rocks. Pelagic sediments are deposited extremely slowly in 
the deep ocean basins. Sedimentation rates of 1-10 
m Ma~' are common. The sediments acquire a post- 
depositional remanent magnetization (pDRM), which is 
an accurate record of the depositional field direction. The 
slow deposition of deep-sea sediments ensures thorough 
averaging of the magnetic field recorded. For example, at 
pelagic sedimentation rates a typical one-inch thick 
sample of deep-sea sediment averages paleomagnetic 
directions acquired during 2500-25,000 yr of deposition. 
The test of the axial geocentric dipole hypothesis in 


modern deep-sea sediments consists of comparing the 
inclination observed in sediment samples with the inclina¬ 
tion expected for the latitude of the site where the sedi¬ 
ment was sampled. The relationship between field 
inclination /, magnetic co-latitude 6 and latitude A was 
developed in Eq. (5.40) and is shown in Fig. 5.54a. The 
mean inclinations of remanent magnetization were mea¬ 
sured in 52 deep-sea sediment cores of Plio-Pleistocene 
age taken from sites at different latitudes in the northern 
and southern hemispheres. The observed inclinations 
agree well with the values predicted by the theoretical 
curve for the axial geocentric dipole hypothesis (Fig. 
5.54b). 

Assuming the direction of the magnetic field recorded 
at a given site to be that of a dipole field, it is possible to 
calculate where the geomagnetic pole would need to be in 
order to produce the observed declination and inclina¬ 
tion. This location is called the virtual geomagnetic pole 
(VGP) position. It is useful in computing where the pole 
lay in ancient times, the so-called paleomagnetic pole. The 
difference between a VGP and a paleomagnetic pole is 
illustrated by the following example for a recently 
extruded lava. Each sample from the lava formed in a 
short interval of time, and the field direction it records 
will be that of the total geomagnetic field at the site, com¬ 
bining axial dipole, non-dipole and non-axial dipole com¬ 
ponents. The VGP will therefore not coincide with the 
rotation axis. If data are collected from several flows of 
different ages, each will carry a slightly different record of 
the field. The computed VGP position will be different 
from flow to flow, and so the distribution of VGP will be 
scattered. If samples are measured from a large number 
of recent lava flows covering a long enough period of time 
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Fig. 5.54 (a) The geocentric axial dipole hypothesis predicts the 
relationship tan/ = 2tanA between the inclination / of a dipole field and 
the magnetic latitude A. (b) The inclinations measured in modern deep- 
sea sediment cores agree well with the theoretical curve (based on data 
from Schneider and Kent, 1990). 


to average the non-dipole and non-axial dipole parts to 
zero, the mean direction of the collection will correspond 
to the field of an axial geocentric dipole. The pole posi¬ 
tion calculated from the mean direction of the collection 
of flows will agree with the rotation axis. This pole, repre¬ 
senting an averaged value of the field, is called a paleo- 
magnetic pole. The VGP represents a spot estimate of the 
field, including non-axial dipole components; the paleo- 
magnetic pole represents an averaged field, corresponding 
to the axial geocentric dipole. 

The paleomagnetic pole positions determined in 
studies of Plio-Pleistocene to Recent volcanic and sedi¬ 
mentary rocks covering the past 5 Ma lend further 
support to the axial geocentric dipole hypothesis. The dis¬ 
tribution of paleomagnetic poles is clustered around the 
geographic pole and not around the present-day geomag¬ 
netic pole (Fig. 5.55). Statistical analysis shows that the 
mean of the paleomagnetic poles does not differ signifi¬ 
cantly from the geographic pole. 


180° 



® Plio-Pleistocene to Recent paleomagnetic poles 
(younger than 5 Ma) 

^ Present-day geomagnetic pole 

Fig. 5.55 Paleomagnetic pole positions for rocks of Plio-Pleistocene to 
Recent age (after McElhinny, 1973). 


The axial geocentric dipole hypothesis maintains that, 
if data are averaged over a long enough interval of time, 
the mean paleomagnetic pole position will coincide with 
the axis of rotation of the Earth. In fact, detailed analysis 
of Late Tertiary paleomagnetic poles has shown that this 
hypothesis does not hold exactly. This is because the 
mean pole position calculated for any field that is sym¬ 
metric about the rotation axis will lie on the axis, provided 
the directions are obtained at sites covering a wide range 
of longitudes. When young paleomagnetic data of the 
same age are averaged for a particular region, they give a 
paleomagnetic pole position on the far side of the 
present-day rotation axis. This “far-sidedness” of paleo- 
magnetic directions is caused by the presence of a small 
axial geocentric quadrupole, amounting to a few percent 
of the axial geocentric dipole. The superposition of the 
axial dipole and quadrupole is equivalent to displacing 
the center of the dipole about 300 km northward along 
the rotation axis away from the center of the Earth. This 
is a second-order effect; to a first approximation the time- 
averaged paleomagnetic field may be considered to be 
that of an axial geocentric dipole, and the paleomagnetic 
pole lies within a few degrees of the rotational pole. 

5.6.3 Methods of paleomagnetism 

The requirement that the mean paleomagnetic pole posi¬ 
tion derived for a collection of rocks should represent the 
axial geocentric dipole is taken into account in the 
methodology of paleomagnetic analysis. This begins with 
the sampling of a rock formation on a hierarchical 
scheme designed to eliminate or minimize non-systematic 
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errors and to average out the effects of secular variation 
of the paleomagnetic field. At each hierarchical level, 
averaging and statistical analysis are carried out on the 
remanent magnetization vectors. Ideally, a paleomagnetic 
collection should contain a large number of samples per 
site. In practice, about 6-10 samples are enough to define 
the mean direction for a site; the mean values of typically 
10-20 sites from the same formation are averaged to get a 
mean paleomagnetic direction for a formation or region. 

A further assumption of paleomagnetism is that the 
natural remanent magnetization (NRM) of a rock was 
acquired at the time of formation of the rock (or at a 
known time in its history), and has since remained unal¬ 
tered. In fact, the NRM is usually made up of several 
components acquired at different times, including during 
the procedures of sampling and preparation. Laboratory 
techniques must be applied that eliminate the undesirable 
components and isolate the primary magnetization. This 
process is loosely called “magnetic cleaning.” 

The presentation of paleomagnetic directions measured 
in rock samples is made with the help of stereographic pro¬ 
jection. This is a way of plotting three-dimensional direc¬ 
tions by projecting them onto a plane. These plots have 
already been encountered in the analysis of first-motion 
studies of earthquakes (see Section 3.5.4.2). A direction is 
identified by the point where it intersects a unit sphere cen¬ 
tered at the observation site. This converts a set of direc¬ 
tions to a set of points on the surface of a sphere. The 
intersection point is then projected onto the horizontal 
plane to give a stereographic plot. This can be done in 
different ways. The Lambert equal-area projection is 
usually preferred in paleomagnetism as it avoids visually 
distorting the dispersion of directions. In geology, all 
directions are plotted on a stereogram as projections on 
the lower hemisphere. In paleomagnetic stereograms direc¬ 
tions with positive (downward) inclinations are plotted as 
lower hemisphere projections; directions with negative 
(upward) inclinations are plotted with a different symbol 
as upper hemisphere projections. 

5.6.3.1 Measurement of remanent magnetization 

Measurements of the natural remanent magnetization of 
rocks with an astatic magnetometer were laborious and 
time consuming and the instrument has now fallen into 
disuse. In modern paleomagnetic laboratories more 
efficient spinner magnetometers and cryogenic magne¬ 
tometers are in common use. 

Spinner magnetometers originally consisted of a large 
sensor coil containing many turns of wire in which an 
alternating signal was induced by rotating the sample at 
high frequency (around 100 Hz) within the coil. Rapid 
rotation was needed because the voltage induced was pro¬ 
portional to the rate of change of flux in the coil. After 
phase-lock detection and electronic amplification of the 
signal, the calibrated output yielded two components of 
remanence in the plane normal to the rotational axis. The 


instrument was susceptible to electrostatic build-up but 
was capable of measuring magnetizations of around 1(L 3 
Am -1 in 10-15 minutes. 

The flux-gate spinner magnetometer is a subsequent 
refinement in which the sensor coil is replaced with flux- 
gate sensors. These detect directly the external magnetic 
fields of the sample. The signal strength is not dependent 
on rotational speed which could be reduced to about 5-10 
Hz. The rotation of the sample gives a sinusoidal output. 
A large number of cycles can be averaged to reduce noise. 
The output is commonly digitized and stored in memory 
in a small on-line computer. The components of magneti¬ 
zation in the plane normal to the rotational axis are then 
determined by Fourier analysis. A computer-controlled 
flux-gate spinner magnetometer is capable of measuring a 
rock magnetization around 5X 10 -5 A m” 1 in standard 
samples under optimum conditions. The complete mea¬ 
surement of a sample takes only a few minutes. 

The cryogenic magnetometer is the most sensitive and 
rapid instrument in current use. Its sensor consists of a 
coil immersed in liquid helium. At this temperature (4 K) 
the coil is superconducting. A small change of magnetic 
field induces a comparatively large current, which because 
of the superconducting condition is persistent until the 
sample is removed. In line with the coil is a Josephson 
junction, which is a quantum-mechanical device consist¬ 
ing of a very thin element that allows the passage of 
current in distinct units proportional to a quantum of 
magnetic flux. By counting the number of flux jumps elec¬ 
tronically, the external magnetic field of the rock specimen 
can be inferred, and from this its magnetization com¬ 
puted. Most cryogenic magnetometers contain orthogo¬ 
nal sets of coils and can measure two or three axes of 
magnetization simultaneously within a few seconds. The 
sensitivity of the instrument corresponds to a rock magne¬ 
tization of 5 X 10 -6 A m _1 in standard samples. 

5.6.32 Stepwise progressive demagnetization 

The natural remanent magnetization (NRM) of a rock 
may contain several components, some related to the geo¬ 
logical history of the rock and others to the sampling and 
handling procedures. It is necessary to “magnetically 
clean” the natural magnetization so that the structure of 
the NRM can be analyzed and stable components iso¬ 
lated. This is done in a stepwise procedure, in which pro¬ 
gressively more and more of the original magnetization is 
removed. There are two main methods of doing this. 

The first method is progressive alternating field (AF) 
demagnetization. An alternating magnetic field can be pro¬ 
duced in a coil by passing an alternating current through 
it. The field fluctuates between equal and opposite peak 
values. When a rock sample is placed in the alternating 
magnetic field, the grain magnetic moments with coercivi- 
ties less than the peak value of the field are remagnetized 
in a new direction; the field cannot affect a magnetization 
component with coercivity higher than the peak field. The 
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intensity of the alternating field is reduced slowly and uni¬ 
formly to zero. This randomizes the part of the rock mag¬ 
netization that has coercivities less than the peak value of 
the alternating magnetic field. The AF demagnetizing coil 
must be surrounded by magnetic shields or special addi¬ 
tional coils to cancel out the Earth’s magnetic field; other¬ 
wise an anhysteretic remanent magnetization (ARM) is 
induced along the direction of this field. 

The part of the remanence that remains after a demag¬ 
netization treatment has been “magnetically cleaned.” 
The direction and intensity of the remanent magnetiza¬ 
tion are affected. The demagnetization procedure is 
repeated using successively higher values of the peak alter¬ 
nating field, remeasuring the remaining magnetization 
after each step, until the magnetization is reduced to zero. 
Suppose that the NRM of a sample consists of two com¬ 
ponents AB and BC with different directions and different 
coercivity spectra (Fig. 5.56a, b). In the early stages of 
progressive demagnetization (steps 1-3) the “soft” compo¬ 
nent BC is first reduced to zero. The vector measured after 
each step in the progressive demagnetization is the sum of 
the “hard” component, which has not yet been affected by 
the field used, and the residual part of the soft component. 
If the direction of the soft component is within 90° of the 
hard component the intensity decreases during this 
demagnetization interval; otherwise it may increase (Fig. 
5.56c). The direction of the resultant vector changes con¬ 
tinually in steps 1-3 (Fig. 5.56d). After removal of the soft 
component in step 3, higher alternating fields (steps 4-7) 
progressively reduce the hard component AB. During this 
stage the intensity decreases constantly but the direction 
remains consistent with little scatter, defining the “stable 
end-point” direction of the magnetization. 

The effectiveness of the AF demagnetization method 
is limited by the strongest peak field that can be produced 
in the demagnetizing coil. This is commonly 0.1 T, 
although some equipment can reach around 0.3 T. These 
peak fields are well below the maximum coercivity of 
pyrrhotite and far below the coercivities of hematite or 
goethite. Thus AF demagnetization is not effective in 
demagnetizing components carried by these minerals. 
The method is most commonly used for rocks that 
contain magnetite as the main magnetic mineral. 

An alternative method of “magnetic cleaning” is pro¬ 
gressive thermal demagnetization. When a rock sample is 
heated to a given temperature T, magnetic components 
that have lower blocking temperatures than T are thermally 
randomized. If the sample is now cooled in field-free space, 
this part of the NRM remains demagnetized. In stepwise 
thermal demagnetization the heating and cooling cycle is 
repeated with progressively higher maximum temperatures. 
The progressive destruction of the magnetization reveals 
the components present in the NRM as in Fig. 5.56. This 
method is often more effective than AF demagnetization, 
because it is only necessary to heat a sample above the 
highest Curie temperature of its constituent minerals to 
destroy all of the NRM. However, if the rock contains 





Demagnetization step 


Fig. 5.56 Stepwise demagnetization of a remanent magnetization 
consisting of two components with different ranges of stability: (a) low 
stability vector BC demagnetizes before stable stable vector AB; (b) 
same situation with different angle between AB and BC; (c) variation of 
intensity for cases (a) and (b); and (d) directional changes on a 
stereogram. Numbers on points indicate successive demagnetization 
temperatures in °C. 

thermally unstable magnetic minerals, irreversible changes 
may complicate the thermal demagnetization method. 

5.63.3 Analysis of magnetization components 

The stability of a remanent magnetization during step¬ 
wise demagnetization can be demonstrated by plotting 
the remaining intensity after each step against the corre¬ 
sponding temperature or AF field, as in Fig. 5.56c; the 
directional stability can be controlled simultaneously by 
plotting the direction after each step on a stereogram 
(Fig. 5.56d). However, analysis of magnetization with an 
intensity plot and stereogram is outmoded. More sophis¬ 
ticated methods treat the magnetization as a vector and 
analyze the stability of its individual components. 

The most powerful method of analysis of the structure 
and stability of a remanent magnetization involves con¬ 
structing a vector diagram. The method was introduced 
by J. D. A. Zijderveld, a Dutch paleomagnetist, in the 
early 1960s. The magnetization at each stage of demagne¬ 
tization is resolved into north (N), east (E) and vertical 
(V) components. Plots are then made of the north com¬ 
ponent against the east component, and of a horizontal 
component (north or east) against the vertical compo¬ 
nent. This is equivalent to projecting the vector onto the 
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Fig. 5.57 The vector diagram 
method (Zijderveld, 1967) for 
analyzing progressive AF or 
thermal demagnetization, (a) 
Schematic diagram showing 
how the components of the 
vector remaining at each 
stage of demagnetization are 
projected as points on three 
orthogonal planes (horizontal, 
vertical N-S and vertical E-W). 
(b) Vector diagram for the 
thermal demagnetization of a 
limestone sample. 
Magnetization components 
with non-overlapping spectra 
of thermal blocking 
temperatures show as linear 
segments. 



(b) 


Up 


Scaglia Variegata limestone 
Contessa valley, Gubbio, Italy 



horizontal plane and the north-south (or east-west) ver¬ 
tical plane (Fig. 5.57a). 

Components of NRM that have distinct spectra of 
coercivities or blocking temperatures show as linear seg¬ 
ments on a vector demagnetization diagram. The 
example in Fig. 5.57b shows three distinct linear segments 
on the horizontal and vertical projections. The slopes of 
the straight lines represent NRM components with 
different directions. A component removed below 150 °C 
is directed downward to the north, and so has normal 
polarity. The component is probably a soft overprint 
(perhaps a VRM, see Section 5.3.6.5) acquired in the 
present day or a recent field. A vector removed between 
150 °C and 300 °C may represent a more ancient over¬ 
print; it is directed in a southerly, upward direction and 
thus has a reversed polarity. If a stable vector is left after 
demagnetization of less stable fractions, it is indicated by 
a straight line to the origin of each half of the vector 
diagram. This is the case for the component removed 
from 300 °C to 580 °C. It is interpreted as a stable primary 
component acquired when the field had reversed polarity. 

If more than one magnetization component is present, 
it is possible that the spectra of coercivity or blocking 
temperature of the components may overlap partially. 
During demagnetization of the overlapping components 
the vector diagram exhibits a curved trajectory. If the 
spectra overlap completely, no straight segment can be 
determined. In this case the sample does not have a single 
stable magnetization component. 

5.6.3.4 Statistical analysis of paleomagnetic directions 

For the purposes of statistical analysis each paleomag¬ 
netic direction in a collection of samples is considered to 
have equal value and may be regarded as a unit vector. 


Each vector has unit length but a different direction. The 
end points of the vectors lie on the surface of a unit 
sphere and form a distribution of points. The statistical 
methods for evaluating paleomagnetic directions (or the 
distribution of points on a sphere) were developed in 
1953 by Sir Ronald Fisher. He found that the best esti¬ 
mate of the mean direction of a population of N unit 
vectors is their vector mean, R. To illustrate this point, 
consider five paleomagnetic directions, each represented 
by a unit vector (Fig. 5.58a). Usually the unit vectors are 
not parallel and when added vectorially their resultant 
has length R<N (Fig. 5.58b); its direction is the best esti¬ 
mate of the mean of the five paleomagnetic directions. 

Fisher proposed that the probability density P(6, k ) of 
the angle 0 between an individual sample direction and 
the mean direction of the distribution is: 

P(6 ' k) = 4TTsinhK ex P( KCOS0 ) (5-60) 

The parameter k is called the '‘precision parameter” or 
‘‘concentration parameter.” It describes the dispersion of 
the directions, and is akin to the inverse of the variance of 
the distribution. Strictly speaking, k is a property of an infi¬ 
nitely large population of directions. However, in paleo¬ 
magnetic investigations only a small number of directions 
are usually sampled; it is assumed that they are representa¬ 
tive for an infinite population. The parameters that are 
computed are approximate estimates of the true parame¬ 
ters of the population. Fisher showed that the best estimate 
(k) of the precision parameter k (valid for k> 3) is given by 


where R is the vector sum of the N unit vectors, computed 
as in Fig. 5.58b. When k (or k) is zero, the directions are 
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Fig. 5.58 (a) Representation of five magnetization directions as unit 
vectors, (b) The vector mean direction is that of the resultant vector R. 

(c) Stereograms of some distributions of paleomagnetic directions: the 
tighter the grouping, the larger the concentration parameter k. 

uniformly or randomly distributed. A badly scattered set 
of directions has a small value of k; large values of k apply 
to tightly grouped directions (Fig. 5.58c). 

As in other statistical situations, the scatter is a prop¬ 
erty of the distribution of directions. It is described by 
the angular standard deviation, which is proportional to 
\t^k. However, it is usually more important to describe 
how well the mean direction is defined. Keep in mind that 
we do not know the true mean direction; we have only 
made an estimate of it, based on the available data. The 
true mean may differ by several degrees from our esti¬ 
mate. However, if we know that with 95% certainty the 
true mean lies within, say, 7° of our estimate, we can draw 
a cone with semi-angle of 7° about the estimated mean 
direction. The cone is said to define the confidence limits 
of the mean at the 95% probability level. The size of the 
confidence limit depends on the number of directions N 
in the distribution and their dispersion parameter k. The 
semi-angle of the cone of confidence is denoted a 95 and is 
given approximately by 

_ 140 

“ 95 VM 


(a) 



Fig. 5.59 (a) Magnetization directions (arrows) around a fold in stable 
(A) and unstable layers (B), and in stably magnetized conglomerate 
cobbles (C). (b) Comparison of directions in the stable layer (A) and 
unstable layer (B) before and after unfolding. 

We could select any level of confidence to describe how 
well the mean is defined. However, two levels are common 
in statistics: the 95% (significant) and the 99% (highly sig¬ 
nificant) levels. In paleomagnetism the level of 95% confi¬ 
dence is used. This means that there is a 95% probability 
that the true mean of the distribution lies within this cone 
about the estimated mean direction. 

5.63.5 Field tests of magnetization stability 

If possible, paleomagnetic sampling includes a field test 
that can establish the stability of the magnetization over 
geological time. This was especially important in the early 
days of paleomagnetism when the laboratory techniques 
of “magnetic cleaning” were not yet available. Pioneering 
researchers devised some ingenious tests of paleomag¬ 
netic stability based on field observations. The fold test 
and reversals test still serve as the best ways to demon¬ 
strate the stability of a remanent magnetization through 
the aeons of geological time and to verify the timing of its 
acquisition. 

The fold test is perhaps the most important paleomag¬ 
netic field test. It is applied to samples taken from beds that 
were originally horizontal and have been tilted by later tec¬ 
tonic effects. If the paleomagnetic direction in the rock is 
stable, it will experience the same rigid-body rotation as the 
tilted strata; its direction will vary around the fold (Fig. 
5.59a, layer A). This is called a pre-folding magnetization. 
On the other hand, if the magnetization was acquired by 
the rock after it was folded, it will have a uniform direction 
at all points of the fold (Fig. 5.59a, layer B). This is called a 


(5.62) 
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(a) before bedding corrections 




Fig. 5.60 Example of a positive fold test in 12 sites of the Scaglia Rossa 
limestone from southern Umbria. The directions (a) before correcting 
for local bedding tilt are more scattered than (b) the corrected 
directions. 

post-folding magnetization. A third situation is common, 
in which the magnetization is acquired during the tectonic 
event; in this case the direction of magnetization changes 
around the fold but by a smaller amount than the folding. 
This is called a synfolding magnetization. 

In practice, the fold test consists of comparing the 
directions before applying tilt corrections with the direc¬ 
tions after unfolding the tilted beds. If samples with a 
stable magnetization are taken from all parts of the fold, 
their uncorrected directions should be smeared out. After 
correction for bedding tilt, the dispersion of directions 
should be reduced, and the directions should group 
around a common direction, which is the pre-folding 
direction of the formation (Fig. 5.59b, layer A). This is 
called a positive fold test. If the magnetization is unstable 
or is due to post-folding remagnetization, the tilt correc¬ 
tions will increase the scatter of the distribution of direc¬ 
tions (Fig. 5.59b, layer B); this is a negative fold test. 

An application of the fold test is shown in Fig. 5.60 for 
12 sites of the Scaglia Rossa limestone from the central 
Apennines in southern Umbria (Italy). The sites were col¬ 
lected on different limbs of long anticlinal structures. 
Mean directions were computed for about 10-12 “mag¬ 
netically cleaned” samples at each site. The uncorrected 
directions are quite scattered, with a confidence cone (a 95 ) 
equal to 11°. After correcting each site for the local tilt of 
the bedding the data are much better grouped, and the 
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Fig. 5.61 The baked contact test, (a) Magnetization directions of 
intrusion and country rock when both are stable and (b) when one is 
unstable, (c) Example of a stable baked contact test for the Abitibi dikes, 
Ontario, Canada: the directions in the dikes are the same as in the 
baked country rock and are different from the present-day field 
direction (after Irving and Naldrett, 1977). 

confidence cone is reduced to 6°. The concentration para¬ 
meter increases significantly, from 14.6 to 46.3, indicating 
that the magnetization was acquired before the folding. 

The conglomerate test is a field test of stability that is 
rather seldom used. Suppose that we are investigating a 
limestone formation and that we discover a conglomerate 
containing cobbles of the limestone (Fig. 5.59a, layer C). 
Assuming that the cobbles have been randomly re-oriented 
by the processes of erosion, transport and re-deposition, 
their paleomagnetic directions, if stable, should be ran¬ 
domly distributed. If a systematic direction is found, the 
magnetization of the limestone may have a large secondary 
component. 

The baked contact test is important in igneous rocks. 
During intrusion of a dike or sill the adjacent layers of 
the host rock are baked by contact with the hot lava and 
acquire a TRM when they cool. In general the magnetic 
minerals in the lava will differ in composition and grain 
size from those in the host rock. If samples taken from the 
lava and contact zone of the host rock have the same mag¬ 
netization direction (Fig. 5.61a), the lava carries a stable 
paleomagnetic vector. If they are different (Fig. 5.61b), 
one of the magnetizations is unstable; alternatively, either 
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Fig. 5.62 The reversals test, (a) Illustration of how the presence of a 
secondary component S can spoil the anti-parallel directions of normal 
and reverse magnetizations, (b) A positive fold test for samples of the 
Maiolica limestone in a site at Apiro (Marches, Italy): the normal and 
reverse mean directions are almost exactly opposite, (c) Mean directions 
of normal and reverse polarity Maiolica sites in southern Umbria are not 
exactly opposite due to local tectonic rotations about vertical axes. 


component, represented by the vector S, will give 
resultant normal and reversed directions that are no 
longer antiparallel. If it is possible to clean the directions 
magnetically, the antipodal normal and reversed direc¬ 
tions (N and R) should be recovered. If magnetic cleaning 
is inadequate, a residual part of the unremoved secondary 
component may spoil the antiparallelism. 

An example in which the reversals test shows success¬ 
ful “cleaning” is shown in Fig. 5.62b for samples from a 
single site in the Early Cretaceous Maiolica limestone in 
central Italy. The vector mean of 14 normal samples has 
Z) N = 298°, / N = 34°, and a 95 = 3°; the mean of 10 
reversed polarity samples has D R = 115°, / R = - 32°, and 
a 95 = 5°. A simple way to compare how well the sets of 
normal and reversed polarity directions agree is to invert 
the mean direction and confidence circle for the reversed 
group of samples through the origin. The common polar¬ 
ity mean directions differ by only 3°. The mean of each 
group lies within the confidence limits of the other, so 
there is no significant difference between the normal and 
reversed directions. 

When the site-mean directions from several sites of the 
Maiolica limestone are compared throughout a large 
region of the Umbrian Apennine mountain belt, the 
antiparallelism of sites with normal and reversed polarities 
no longer holds. The vector mean of 10 normal sites has 
Z> N = 313°, / N = 38°, and a 95 = 8°; the mean of six reversed 
polarity sites has D R = 144°, / R = -33°, and a 95 = 7° (Fig. 
5.62c). In this case the common polarity mean directions 
differ by 10°. The mean of each group lies outside the confi¬ 
dence limits of the other, so there is now a significant 
difference between the normal and reversed directions. 
Closer examination shows that the mean inclinations I of 
the normal and reversed groups are equivalent, but the dec¬ 
linations D are dispersed along a small circle about a verti¬ 
cal pole. The smeared declinations reflect small rotations 
of each site about a vertical axis, the result of regional tec- 
tonism in the area of investigation. This illustrates an 
important application of paleomagnetism: the description 
of tectonic rotations that would otherwise be difficult or 
impossible to observe in the field. 


the lava or the baked zone was re-magnetized at a later 
time. An example of stable magnetizations in Precambrian 
rocks from the Canadian shield is shown by the agreement 
between the directions in the Abitibi diabase dikes and the 
heated contact zone (Fig. 5.61c). 

The reversals test can be applied when the paleomag- 
netic samples represent a large enough time interval (>10 
ka) to have recorded normal and reversed polarities of 
the magnetic field. Remanent magnetizations acquired 
within successive intervals of constant polarity of the 
Earth’s magnetic field should be exactly antiparallel. Let 
the normal magnetization be represented by the vector 
N and the reversed magnetization by the vector R 
(Fig. 5.62a). The presence of an unremoved secondary 


5.6.4 Paleomagnetism and tectonics 

Paleomagnetism has made important contributions in 
documenting local and regional tectonic motions as well 
as the motions of lithospheric plates. The reason for the 
failure of the reversals test in Fig. 5.62c was ascribed to 
local tectonic disturbances within a region. To make use 
of paleomagnetic data on a larger scale the observed 
directions must be compared to suitable reference direc¬ 
tions. A reference direction can be computed, if it is 
known where the paleomagnetic pole was in the geologi¬ 
cal past. The history of paleomagnetic pole positions can 
be established on a continental scale. 

Paleomagnetic results from central and southern 
Europe document the effects of large-scale tectonics 
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Fig. 5.63 Declinations of Permo-Triassic and Cretaceous rocks from 
Italy differ systematically from those of Europe north and west of the 
Alps, but agree well with directions predicted for the African plate. A 
hypothetical outline of the Adriatic promontory to the African plate is 
suggested by the shaded line. 

(Fig. 5.63). Data of Late Paleozoic to Late Cretaceous 
age are represented in this analysis. During this long time 
interval large amounts of motion of the European and 
African plates have taken place. As a result the inclina¬ 
tions measured at the indicated sites show large varia¬ 
tions. However, the reference declinations for sites in 
Europe do not vary much during this time; this is also 
true for the African reference declinations. There is a large 
difference between the north-northeast pointing Euro¬ 
pean declinations and the northwest directed African 
declinations (Fig. 5.63). The paleomagnetic declinations 
observed at sites in central and southern Europe show 
distinct affinities. North and west of a crude line through 
the Alpine chain the paleomagnetic declinations agree 
with those expected for the European continent. The dec¬ 
linations observed south of the Alps, on the Italian penin¬ 
sula and Sicily, are oriented toward the northwest, in 
agreement with directions expected for the African conti¬ 
nent. The pattern of paleomagnetic data supports a tec¬ 
tonic interpretation of the Italian peninsula and adjacent 
regions of the Adriatic as a northern promontory of the 
African plate. Although the differences in the declination 
pattern are striking, there is a certain amount of leeway in 
the interpretation. This is mainly because the reference 
directions of Africa are derived from paleomagnetic pole 
locations that are not very well defined for some time 
intervals. 


geographic 



Fig. 5.64 Method of locating the virtual geomagnetic pole (VGP) from 
the declination D and inclination / measured at a site (after Nagata, 1961). 


5.6.4.1 Location of the virtual geomagnetic pole 

Paleomagnetic poles are computed as the average of 
virtual geomagnetic pole (VGP) positions calculated for a 
number of samples at a site. The VGP position is where 
the pole of a geocentric magnetic dipole would need to be 
in order to give the observed declination D and inclina¬ 
tion / of the remanent magnetization measured in the 
sample. The method of computation of the VGP position 
is illustrated in Fig. 5.64. First, from the inclination of the 
magnetization (i.e., the paleofield) we can calculate how 
far away the VGP was at the time the rock magnetization 
was acquired. The angular distance to the pole p, assum¬ 
ing a dipole magnetic field, is obtained by using the rela¬ 
tionship between inclination and polar angle (Fig. 5.54a). 

The value of p determines the radius of a small circle 
centered on the paleomagnetic sampling site at latitude A s 
and longitude <£ s . The circle is the locus of all possible 
VGP positions that could give the observed inclination I 
at the site. We next have to decide which point on the 
small circle is the VGP position. The declination of the 
remanent magnetization is the angle between geographic 
north and the horizontal direction to the ancient mag¬ 
netic pole. In this case the declination defines a meridian 
(or great circle) which passes through the sampling site 
and makes an angle D with the north-south meridian 
(Fig. 5.64). The place where this great circle intersects the 
small circle with radius p is the location of the virtual geo¬ 
magnetic pole. Its latitude (A p ) and longitude (</> p ) can be 
computed exactly from trigonometric formulas (Box 5.6): 

sinA p = sin A s cos/; + cos A s sin p cos D 
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Box 5.6: Virtual geomagnetic pole (VGP) location 


The sine and cosine relationships between the sides and 
angles of spherical triangles (Box 1.4) may be applied to 
the spherical triangle in Fig. 5.64, its corners corre¬ 
sponding to the site, pole position (VGP) and geo¬ 
graphic pole to determine the unknown latitude and 
longitude of the virtual paleomagnetic pole. The length 
of the side between the site and the geographic pole, 
measured in degrees of arc, is (90-A s ), where A s is the site 
latitude. The length of the side between the VGP posi¬ 
tion and the geographic pole is (90-A p ), where A p is the 
VGP latitude. The length of the third side of the triangle 
is p. The direction to the VGP position from the site is 
the declination D, which is opposite the side of length 
(90-A ). These values are substituted in the law of 
cosines (Box 1.4, Eq. (4)) as follows: 

a=(90-A p ) cosa = sinA p 

/>=(90-A s ) cosfe = sinA s sin6 = cosA s (1) 
c=p A=D 

sinA p = sinA s cos/> + cosA s sinpcosZ) (2) 

The great circles through the site and the VGP loca¬ 
tion meet at the geographic pole where they form an 
angle, for which there are two possibilities: the acute 
angle /3, or its obtuse equivalent (180 - /3). As a result, 
the solution for the longitude 4> p of the paleomagnetic 
pole involves two steps. 

First, applying the law of sines (Box 1.4, Eq. (3)) to 
the spherical triangle gives 


sin/3 = sinD 

sinp sin(90-A p ) ' ' 

from which the magnitude of the angle /3 may be 
obtained 

sinpsinD 

s11 ^ = ^a7 W 

Next, in order to decide whether the solution 
requires (3 or (180 - /3), the law of cosines is again 
applied to the spherical triangle, this time for side p, 
with the following substitutions: 

a= p A=/3 

b=( 90 — A s ) cos/> = sinA s sin/? = cosA s (5) 

c=(90 —A p ) cosc = sinA p sinc = cosA p 

These give 

cos p = sinA s sinA p + cosA s cosA p cos/3 (6) 

(cosp — sinA s sinA p ) = cosA s cosA p cos/3 (7) 

On the right-hand side of Eq. (7), cosA. and cosA„ are 
always positive, so cos/3 must take the sign of the left 
hand side of the equation. This gives the two possibilities 
for the value of (</> p - </> s ), namely, /3 or (180 - /3). Thus, 

(</>-(/>.) =/3 for (cos/7 - sinA sinA ) > 0 

( 8 ) 

(</> p — (/> s ) = 180 - (3 for (cos/7 - smA s sinA p ) <0 


4> p = 4> s + (3 , for cos/7 > sinA s sinA p 
(/> p = </> s + 180 — /3, for cos/7<sinA s sinA p 


(5.63) 


where 


sin/3 


sinpsinZ) 

cosA p 


(5.64) 


The longitude of the paleomagnetic pole is here given 
relative to a fixed meridian in present-day geographic coor¬ 
dinates. The key paleomagnetic parameter is the distance p 
of the investigated site from the Earth’s rotation axis, 
which was the position of the paleomagnetic pole at the 
time of formation of the rocks under investigation. At the 
time of magnetization all locations on the same latitude 
(i.e., at the same distance p from the rotation axis) were 
magnetized with zero declination, because the axial dipole 
field lines through the site lead to the rotation axis. The 
longitude of the site (its position on the circle of latitude) 
remains indeterminate. The declination measured later at 
the site is the expression of any change of azimuthal orien¬ 
tation, which can result, for example, from local tectonic 
motion or from large-scale continental displacement. 


5.64.2 Apparent polar wander paths 

The observation that paleomagnetic poles obtained from 
rocks of Pleistocene and Pliocene age are closely grouped 
about the geographic pole (see Fig. 5.55) is in agreement 
with the axial geocentric dipole hypothesis. However, when 
paleomagnetic pole positions are calculated for old rocks 
from the same continent, they group far away from the geo¬ 
graphic pole. This is illustrated by the positions of paleo¬ 
magnetic poles from the stable European craton. Pliocene 
and Pleistocene poles group close to the geographic pole 
but Permian poles are located about 45° away (Fig. 5.65). If 
the axial dipole hypothesis is valid for rocks of all ages, the 
pole distributions imply that the geographic pole for 
Europe in the Permian period (about 250-290 Ma ago) lay 
far from its present position. An alternative interpretation 
is that the geographic pole has not changed, but the 
European continent has moved relative to the pole. This 
suggests that the position about which the Permian poles 
now cluster was on the rotation axis in the Permian period. 
The European continent has subsequently moved to its 
present-day position with regard to the rotation axis. 
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European paleomagnetic poles : 

<H Pliocene and Pleistocene 

• Permian 

Fig. 5.65 Locations of European paleomagnetic poles. Pliocene and 
Pleistocene poles (data source: McElhinny, 1973) lie close to the 
present-day geographic pole, while Permian poles (data source: Van der 
Voo, 1993) are located at about 45°N in the northwest Pacific Ocean. 

Paleomagnetic data allow us to resolve the ambiguity. 
If paleomagnetic pole positions are computed for rocks 
of different ages from the same continent, they plot sys¬ 
tematically along an irregular, curved path. It appears as 
though the paleomagnetic pole has moved slowly along 
this path towards the present rotation axis. The apparent 
motion of the paleomagnetic pole is called apparent polar 
wander (APW) and the path is called an apparent polar 
wander path. The paleomagnetic data from a particular 
continent define a unique APW path for that continent, 
and each continent has a different APW path. Thus, we 
have a European APW path, African APW path. North 
American APW path, and so on. 

A schematic plot of the European and North 
American APW paths since the Late Paleozoic shows 
clearly distinct curves (Fig. 5.66). Each APW path lies on 
the opposite side of the geographic pole from the conti¬ 
nent to which it belongs. Keeping the axial geocentric 
dipole hypothesis in mind, it is obviously impossible that 
the paleomagnetic pole (i.e., the Earth's rotation axis) 
could have moved simultaneously along two different 
APW paths. The two APW paths evidently represent the 
separate motions of the European and North American 
continents relative to the rotation axis. They constitute 
paleomagnetic evidence for “continental drift.” 

5.6.43 Paleogeographic reconstructions using APW paths 

A more detailed plot of the two APW paths for the time 
before the Late Jurassic (Fig. 5.67a) shows strong similar¬ 
ities in their shapes, particularly for the time from the 


90°E 



Fig. 5.66 Average apparent polar wander paths for North America and 
Europe in the past 350 Ma (after Irving, 1977). Numbers on paths are 
age in Ma. 


Upper Carboniferous to the Upper Triassic. It is possible 
to overlay these two segments of the APW paths by 
moving Europe (including Russia west of the Ural moun¬ 
tains) and North America into different positions relative 
to each other (Fig. 5.67b). For the time represented by the 
overlap of the APW paths the two continents formed part 
of a larger “supercontinent” called Euramerica. When 
the adjacent part of Asia east of the Urals is included, the 
continents in the northern hemisphere form an earlier 
landmass called Laurasia. The present separation of the 
APW paths (Figs. 5.66, 5.67a) is interpreted as evidence 
for relative plate tectonic motion between Europe and 
North America that has taken place since the end of the 
interval for which the APW paths overlap well, i.e., since 
the Early Jurassic. 

In order to bring the two APW paths into coincidence 
we have to move Europe relative to North America (or 
vice versa) so as to close the present gap between the con¬ 
tinents. As shown in Section 1.2.9, the relative motion of 
plates on the surface of the spherical Earth is equivalent 
to a relative rotation about an Euler pole of rotation. The 
computer-generated “Bullard” fit of the 500 fathom 
contour lines on opposites sides of the North Atlantic 
ocean (see Section 1.2.2.2) can be obtained by displacing 
Europe toward North America by a clockwise rotation 
through 38° about the Euler pole located at 88.5°N 
27.7°E, which by coincidence is very close to the present- 
day geographical pole (Fig. 5.67a). The APW path of a 
continent is constrained to move with the continent. If 
the European APW path is also rotated by 38° clockwise 
about the same Euler pole, the observed overlap of the 
Upper Carboniferous to Upper Triassic sections of the 
European and North American APW paths is obtained. 
Later segments of the paths diverge, indicating relative 
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Fig. 5.67 (a) The Ordovician 
to Jurassic segments of the 
North American and 
European APW paths, (b) The 
same APW paths after 
rotating Europe by 38° 
clockwise about the Euler 
rotation pole at 88.5°N 
27.7°E, marked by the square 
symbol in (a) (after Van der 
Voo, 1990). 



motion between the continents. The Late Jurassic pole 
position corresponds to the Earth's rotation axis at that 
time. Circles of paleolatitude about the North American 
paleomagnetic pole emphasize the paleogeographic 
reconstruction of the relative positions of Europe and 
America in the Late Jurassic. 

The interpretation of APW paths is not always as 
clear-cut as in this example. Consider the situation of the 
paleomagnetic pole P for continental plate C, which is 
rotated through an angle 11 about the Euler pole of rota¬ 
tion, E. First, let the plate lie between the paleomagnetic 
pole and the rotation pole (Fig. 5.68a). The rotation of 
the plate from C to C' causes the paleomagnetic pole P to 
move to P'. The arc PP' of the polar motion is longer 
than the arc CC' of the true plate motion. Next, consider 
what happens when the paleomagnetic pole lies between 
the plate and the Euler pole (Fig. 5.68b). In this case the 
plate moves through a large distance but the paleomag¬ 
netic pole moves only a small distance. In the extreme 
case where the paleomagnetic and Euler poles coincide, 
the plate rotation does not move the paleomagnetic pole 
at all. Under these special conditions plate motion leaves 
no trace in the APW path of the plate. 

Clearly, the interpretation of an APW path as a record 
of plate motion relative to the geographic axis must be 
made with caution. The rate of motion of the pole along 
an APW path cannot be simply equated with the rate of 
motion of the parent continent or global plate. It follows 
that similarity of APW paths does not imply a unique 
solution for former relative plate positions. However, if 
two continents once belonged to the same plate for some 
length of time, they should have acquired the same APW 
path for this time. Matching the present APW paths of 
the continents for the time they were on the same plate 
should give a unique reconstruction of the earlier posi¬ 
tions of the continents relative to each other. To avoid 



(b) 


Fig. 5.68 Rotation of a continental plate C about an Euler pole E 
displaces the paleomagnetic pole P (a) by a large amount, if P is further 
from E than the continent C, and (b) by only a small amount when P lies 
close to E. 
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Fig. 5.69 The configurations 
of Pangaea models A1, A2 
and B based on the matching 
of coastlines and the 
optimizing of paleomagnetic 
data (after Morel and Irving, 
1981). 



ambiguities, additional independent evidence (such as 
paleoclimatic data, or computer matching of coastlines) 
must be utilized in conjunction with paleomagnetic data 
for making such reconstructions. 

5.6.44 Paleomagnetism and continental drift 

The nineteenth century geologist Eduard Suess deduced 
the existence of a great Late Paleozoic continent, which he 
called Gondwanaland (Section 1.2.1). It was composed of 
Africa, Antarctica, Arabia, Australia, India and South 
America. In 1912, Wegener went a step further by postu¬ 
lating that all the present continents lay close together 
during the Late Paleozoic, forming a single great conti¬ 
nent that he called Pangaea. Wegener’s concept was based 
on paleoclimatic evidence, the matching of Carboniferous 
coal belts and of regions of Paleozoic glaciation in the 
different continents. Subsequently, additional geological 
evidence for the existence of Gondwanaland and Pangaea 
during the Late Paleozoic and Early Mesozoic accumu¬ 
lated from the fields of sedimentology, paleontology and 
tectonics. The earlier great continents were presumed to 
have dispersed to their present-day location by the process 
of continental drift. Unfortunately, Wegener was unable to 
offer a satisfactory driving mechanism for continental 
drift, and some of his ideas were found to be extreme. 
Scepticism among geophysicists and geologists brought 
Wegener’s theories into disrepute. 

Interest in continental drift was re-awakened in the 
1950s by the development of paleomagnetism. Soon there¬ 
after some of the most convincing paleomagnetic evidence 
for continental drift was obtained from the “southern con¬ 
tinents.” Researchers found that Mesozoic paleomagnetic 
pole positions of the same age from these continents were 
very dissimilar. In landmark contributions E. Irving 
showed that the paleomagnetic poles of the southern con¬ 
tinents were incompatible with the present-day arrange¬ 
ment of these continents, but that they agreed much better 
when the continents were rearranged to conform with a 
Gondwanaland reconstruction. Numerous paleomagnetic 
investigations have subsequently provided a rich database 
that can be used to test the validity of reconstructions of 
former great continents at different times in their history. 


The reconstructions are generally not made on paleo¬ 
magnetic evidence alone. Usually, a model is proposed, 
based on geometrical or geological grounds. The congruity 
of paleomagnetic pole positions from the separate 
continents is then evaluated in their reconstructed posi¬ 
tions. The model is adjusted iteratively until a configura¬ 
tion of the continents is obtained that gives minimum 
dispersion of the paleomagnetic poles. 

The evaluation of paleomagnetic data from the 
Gondwanic continents, North America and Europe lends 
convincing support to the reconstructions and thereby to 
the continental drift hypothesis. From the Carboniferous 
to the Triassic, contemporary paleomagnetic poles from 
the individual continents do not agree when the conti¬ 
nents are in their present positions, but are more consis¬ 
tent when the great continent is reconstructed. In fact, the 
paleomagnetic data are of high enough quality to suggest 
refinements to the purely geometric reconstructions. The 
Pangaea model in Fig. 5.69a corresponds closely to com¬ 
puter-assisted matches of the continental coastlines (see 
Section 1.2.2.2); it is referred to as Pangaea Al. It places 
the east coast of North America adjacent to the coast of 
northwest Africa. This configuration is supported well by 
Late Triassic and Early Jurassic paleomagnetic poles. It is 
the generally accepted model of Pangaea immediately 
prior to its breakup in the Early Jurassic. However, older 
paleomagnetic data of Permian and Carboniferous age 
(around 280 Ma) are less compatible with the Pangaea Al 
model. Results of Late Permian to Middle Triassic age 
agree much better with a configuration referred to as 
Pangaea A2 (Fig. 5.69b), first proposed by R. Van der 
Voo and R. French in 1974. In this model, North America 
is much closer to South America and its eastern coast is 
opposite to western Africa. The transition from the Late 
Permian Pangaea A2 to the Late Triassic Pangaea Al 
configuration requires a large dextral shear between the 
continents in this time interval. 

Pangaea may have had yet another configuration 
earlier in its history. In 1977 E. Irving showed that results 
of Carboniferous and Early Permian age agree better for 
a Pangaea configuration in which the east coast of North 
America is adjacent to the west coast of South America 
(Fig. 5.69c). This model, Pangaea B, is possible because 
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paleomagnetic longitudes are much more poorly con¬ 
strained than paleolatitudes. The change from Early 
Permian Pangaea B to Late Permian Pangaea A2 
requires a huge dextral megashear between Laurasia and 
Gondwana. 

The models Pangaea Al, A2 and B are each consistent 
with paleomagnetic data for the different times of the 
reconstructions, which span about 100 Ma. None of the 
models accounts for the apparent polar wander paths of 
the individual continents over the whole interval of time 
from Early Permian to Late Triassic or Early Jurassic. 
Instead, the differences between the models imply that 
Pangaea was not a static great continent for this entire time 
interval, but that internal motions took place between the 
constituent continents. 

Paleomagnetic data can be used to reconstruct the rela¬ 
tive positions of continents during any time interval with 
enough good paleomagnetic data. APW paths (e.g., Fig. 
5.66) can be determined fairly precisely by averaging the 
best available pole positions in 20-40 Ma time windows. 
Optimum fitting of APW paths of different continents 
allows reconstructions to be made for the time repre¬ 
sented by the matching segments (Fig. 5.67). When this 
procedure is applied to paleomagnetic data covering the 
last 375 Ma, a picture of continental drift since the 
Middle Devonian is obtained (Section 1.2.2.3). Accord¬ 
ing to this scenario, the supercontinents Laurasia and 
Gondwana, which were still separated by the Hercynian 
Ocean (Fig. 5.70) in the Devonian, collided in the 
Carboniferous to form Pangaea. The paleomagnetic 
reconstruction of continental drift for older epochs 
becomes tenuous because reliable paleomagnetic data 
become scarcer. The derivation of durable reconstructions 
for the Early Paleozoic and Precambrian will be a long 
and painstaking process. 

The positions of continents since the breakup of 
Pangaea can also be obtained from analysis of APW 
paths. However, more precise reconstructions can be 
made by using a different form of paleomagnetic data, 
namely the record of geomagnetic polarity. Sea-floor 
spreading has imprinted this record in the oceanic crust, 
creating lineated magnetic anomalies. Matching coeval 
anomalies allows us to trace the motions of the lithos¬ 
pheric plates since the Middle Jurassic and describe the 
drift of the continents which they transport. 

5.7 GEOMAGNETIC POLARITY 

5.7.1 Introduction 

The earliest demonstration that the geomagnetic field has 
changed polarity in the past was made by the French sci¬ 
entists P. David and B. Brunhes. In 1904-6 they described 
the magnetic properties of young lava flows in the Massif 
Central region of France. They found that clays baked by 
the lava flows had the same direction of remanent magne¬ 
tization as the lavas. Moreover, when the magnetization 


(Middle Devonian 375 M a) 



Fig. 5.70 Continental drift since the Devonian is illustrated by 
reconstructions of the positions of the major continental blocks at 
different times based on paleomagnetic data (after Irving, 1977). 


direction in the lava was opposite to that of the present- 
day field, the same was the case in the baked clay. The 
opposite polarities were interpreted as evidence that the 
geomagnetic field can reverse its polarity. 

A Japanese scientist, M. Matuyama, was the first to 
associate the polarity of remanent magnetization in 
lavas with their age, determined stratigraphically. In 
1929 he reported finding young Quaternary lavas with 
magnetization directions close to the present-day field 
direction, whereas the directions in older Quaternary 
and Pleistocene lavas were clustered about an antipodal 
direction. He also found that one of three samples of 
Miocene basalt was magnetized oppositely to the other 
two. Matuyama’s interpretation was that geomagnetic 
polarity had changed several times during Late Tertiary 
time. 

The idea that geomagnetic polarity could change was 
controversial, and for many years sceptics sought alterna¬ 
tive interpretations. Scientists realized that the observed 
reversed polarities might have a mineralogical explanation. 
Indeed, some ferromagnetic minerals, because of their 
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composition and structure, can acquire a thermoremanent 
magnetization exactly opposite to the field direction. This 
mechanism is called self-reversal of magnetization. It has 
been described in lavas in which the ferromagnetic miner¬ 
als are particular forms of titanohematite. Fortunately, it is 
a rather rare phenomenon. Most records of polarity rever¬ 
sals have been found to be a feature of the geomagnetic 
field. 

To envisage what a reversal of geomagnetic polarity 
means, imagine that the geomagnetic dipole inverts its 
direction. At present the axial geocentric dipole points 
from the northern hemisphere towards the southern 
hemisphere; a polarity reversal would orient the dipole in 
the opposite direction. At each point on the surface the 
magnetic inclination / changes sign and the declination D 
changes by 180°; for example, a normal direction {/= 40°, 
D = 30°} might change to a reverse direction {/= —40°, D 
= 210°}. A polarity reversal is a global event, experienced 
simultaneously all over the Earth. Thus, geomagnetic 
reversals provide a convenient means of stratigraphic cor¬ 
relation and dating. 

5.7.1.1 Geomagnetic polarity transitions 


Steens Mountain (Oregon), Miocene polarity reversal 




Fig. 5.71 The record of a reversed-to-normal Miocene polarity 
transition at Steens Mountain, Oregon, (a) The paleointensity record 
during the transition and (b) the directional record, shown as the 
angular deviation from the normal paleomagnetic direction outside the 
transition (after Prevot ef at, 198S). 


The change of polarity from one sense to the opposite 
one is called a polarity transition. Paleomagnetic records 
of polarity transitions have been observed in radiometri- 
cally dated lava sequences and in deep-sea sediments with 
known deposition rates. These records indicate that the 
duration of a polarity transition is about 3.5-5 ka. This is 
much shorter than the length of the interval of constant 
polarity before or after the transition, which may last for 
hundreds of thousands or millions of years. 

It is not yet known for sure how the geomagnetic field 
behaves during a polarity transition. The dipole field is 
dominant before and after a transition, but it is not certain 
that this is the case during the transition. Detailed analy¬ 
ses of field behavior during a polarity transition usually 
show a notable decrease in field intensity (Fig. 5.71); this is 
observed in volcanic and sedimentary records of reversals. 
Possibly the dipole component disappears, granting more 
importance to higher-order quadrupole or octupole field 
configurations. There seems to be stronger evidence that, 
even though its intensity decreases, the transitional field is 
still dominantly that of a dipole. If this can be assumed, 
the position of the virtual geomagnetic pole (VGP) of the 
transitional dipole field can be calculated. During a polar¬ 
ity transition the VGP position changes progressively. It 
appears to move systematically relative to the Earth’s rota¬ 
tion axis, defining a path from one polar region to the 
opposite one. The transitional paths of many reversals 
appear to define two longitudinal belts, one over the 
Americas and an antipodal belt over Southeast Asia. 
However, many other transitions do not pass over these 
two belts. It has not been established conclusively that a 
path over the Americas or Southeast Asia is a preferred 
feature of polarity transitions. 


Polarity 

interpretation 
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Fig. 5.72 Definition of polarity chrons, subchrons and transitions 
(modified after Cox, 1982, and Harland ef a/., 1990). 


5.7. 1.2 Geomagnetic polarity intervals 

Long intervals of constant normal or reversed polarity, 
originally called polarity epochs , are referred to as polar¬ 
ity chrons (Fig. 5.72); they last typically from 50 ka to 
5Ma. The polarity chrons are interrupted at irregular 
intervals by shorter polarity subchrons (originally called 
events) lasting for 20-50 ka. At times the polarity record 
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Fig. 5.73 Progressive evolution and refinement of the magnetic polarity 
timescale. 


shows large departures of the magnetic pole from 
normal or reversed polarity, but the polarity does not 
change completely; the pole wanders into equatorial lat¬ 
itudes but returns to its initial location on the rotation 
axis. The departure is short lived, lasting less than 10 ka, 
and the phenomenon is called a magnetic excursion. The 
irregular pattern of polarity intervals in any sequence 
provides a kind of geological fingerprint which can be 
used under favorable circumstances to date and corre¬ 
late some types of sedimentary rocks. This procedure 
is called magnetic polarity stratigraphy, or magne¬ 
tostratigraphy. 


5.7.2 Magnetostratigraphy in lavas and sediments 

In the 1950s the methods of dating rocks took a giant 
step forward with the development of improved tech¬ 
niques for dating rocks radiometrically. The potas¬ 
sium-argon method (Section 4.1.4.4) was applied to the 
determination of accurate ages for Pliocene and Pleisto¬ 
cene lava samples from flows that were also sampled for 
paleomagnetic purposes. The polarity of the thermore¬ 
manent magnetization of the lava was found to correlate 
with its age. There were distinct intervals of time in which 
the field polarity was the same as at present, and these 
were separated by intervals of exactly opposite polarity. 
At first the data were sparse, and in the earliest interpre¬ 
tations it was thought that the field changed polarity 
quite regularly, roughly once every million years (Fig. 
5.73). Gradually, however, a more complex history 
evolved. Long epochs of a given polarity were found to 
contain much shorter events of opposite polarity. The 
polarity epochs were named after important investiga¬ 
tors of paleomagnetic polarity (Brunhes, Matuyama) 
and geomagnetism (Gauss, Gilbert) while the polarity 
events were named after the geographical location where 
they were first discovered (Jaramillo creek in New 
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Fig. 5.74 Left : a composite radiometric timescale for the past 5 Ma, 
compiled from several sources (Mankinen and Dalrymple, 1979; Spell 
and McDougall, 1992; McDougall efa/., 1992). Right: a polarity 
timescale for the same time interval obtained from marine magnetic 
anomalies, dated by correlation to radiometrically dated tie-points as 
well as by astrochronological ages based on Milankovitch cyclicities 
(Cande and Kent, 1992, 1995). 


Mexico, Olduvai gorge in Africa, etc.). Countless studies 
of magnetic polarity in radiometrically dated lavas have 
established the history of geomagnetic polarity in the last 
5 Ma (Fig. 5.74). If the polarity record in a rock sequence 
can be identified, its age can be determined by compari¬ 
son with the dated sequence. For this reason a dated 
polarity sequence is called a geomagnetic polarity 
timescale. 

There is a practical limit to the application of this 
technique. As can be seen by quick inspection, some of 
the polarity events last less than 50 ka. If a reasonable 
precision of 1-2% is assumed for potassium-argon 
dating, the error in determining the age of a lava sample 
that is about 5 Ma old amounts to 50-100 ka. This is 
longer than the duration of many short events. The 
dating error makes it impossible to associate the lava 
sample unambiguously with the correct polarity event. 
Extension of the magnetic polarity timescale beyond 
5 Ma requires other methods. 

In the middle 1960s the polarity record from lavas was 
augmented by a large amount of high-quality data 
acquired from young deep-sea sediments. The deep ocean 
basins provide a tranquil depositional environment, where 
sediments are deposited at rather uniform rates. Marine 
geologists routinely take cores of sediment with a special 
coring device (see Fig. 4.26) for sedimentological, geophys¬ 
ical and paleontological studies. The magnetostratigraphy 
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Fig. 5.75 Variations in magnetic inclination and inferred polarity with 
depth in two deep-sea sediment cores (based on data from Opdyke ef 
a/., 1968). 


of a core is studied by measuring the direction of magneti¬ 
zation in small oriented samples from different depths in 
the core. Although deep-sea cores have vertical axes, they 
are not oriented in azimuth, so the declinations can only be 
determined relative to an arbitrary reference value. Polarity 
determinations are often based only on the inclination 
records. The boundaries between normal and reverse 
magnetozones are interpolated at the depths where the 
inclination is zero (Fig. 5.75). In equatorial cores, where 
inclinations are nearly zero, the relative changes in declina¬ 
tion often give a good polarity record. The polarity records 
of numerous cores correlate well with the sequence found 
in contemporaneous young lavas. The sediment magnetic 
polarity records are independent of lithology; the same 
reversal occurs at different depths from core to core 
because of different sedimentation rates (Fig. 5.76). By cor¬ 
relating reversals with the radiometrically dated lava 
record, the sediment ages at the reversal depths are 
obtained. From the depths and ages it is a simple matter to 
calculate the incremental sedimentation rates in the core. 
In addition to providing sedimentation rates, magnetic 
polarity stratigraphy also yields the absolute ages of the 
first and last appearances of key fossils, and so gives 
absolute dates for paleontological fossil zones. A recent 
innovation is the use of astrochronology, based on the 
identification of Milankovitch cycles (Section 2.3.4.5), to 
provide refined dating of sediments and their polarity 
record. 

The magnetostratigraphic data from deep-sea sediment 
cores eliminated the lingering doubt that reversals 
observed in lavas may be due to a self-reversal mechanism. 
Lavas and sediments acquire their magnetizations by quite 
different mechanisms; the thermoremanent magnetization 
of a lava is acquired rapidly during cooling from high tem¬ 
perature, whereas the depositional or post-depositional 


remanent magnetization in a sediment is acquired slowly 
at constant ambient temperature. Although a self-reversal 
mechanism might be invoked to cast doubt on polarity 
changes in lavas, the argument is invalid for the remanence 
of deep-sea sediments. The common polarity sequence in 
lavas and sediments can only be explained as a record of 
the alternations of polarity of the Earth’s magnetic field. 
Moreover, the same pattern of reversals is found regard¬ 
less of geographical location, emphasizing that reversals 
are a global phenomenon. 

5.7.3 Marine magnetic anomalies and geomagnetic polarity 
history 

The striped magnetic anomaly patterns formed at oceanic 
ridges contribute to the compelling geophysical evidence 
in favor of the theory of global plate tectonics (see 
Section 1.2.5). Marine magnetic surveys and independent 
investigations of the rock magnetic properties of marine 
rocks and sediments have identified the source of the 
magnetic anomalies to be the basaltic Layer 2A of the 
oceanic crust. 

Seismic evidence indicates that the oceanic crust has a 
vertically stratified structure (see Fig. 3.85). The upper¬ 
most part, seismic Layer 1, consists of a layer of slowly 
accumulating marine sediments; the thickness of the sedi¬ 
ments increases progressively away from the ridge crest. 
The sediments are so weakly magnetic that they are essen¬ 
tially transparent to the Earth's magnetic field. The 
seismic Layer 2A consists of a 500 m thick layer of 
oceanic basalts that are extruded as submarine lava flows 
or intruded as dikes. These basalts are strongly magnetic 
and are chiefly responsible for the strong magnetic anom¬ 
alies observed at the ocean surface. The metamorphosed 
basalts of the underlying Layer 2B are too weakly mag¬ 
netic to have much signature. The rocks of the deeper 
gabbroic Layer 3 may be sufficiently magnetic to add to 
the skewness of the magnetic anomalies. 

5.7.3.1 Marine magnetic anomalies 

The origin of marine magnetic anomalies was explained 
by the Vine-Matthews-Morley hypothesis in 1963 
(Section 1.2.5.1). Oceanic basalts were found to have 
strong and stable remanent magnetizations. Their 
Konigsberger ratios are much larger than unity, so the 
remanent magnetizations are more important than the 
magnetization induced by the present-day geomagnetic 
field. The conventional method of interpreting surveyed 
magnetic anomalies assumed that the anomaly was due 
to the susceptibility contrast between adjacent crustal 
blocks. According to the Vine-Matthews-Morley hypo¬ 
thesis the oceanic magnetic anomalies arise from the con¬ 
trast in remanent magnetizations between adjacent, 
oppositely magnetized crustal blocks. The remanent mag¬ 
netization is acquired thermally by the basalts in oceanic 
crustal Layer 2A. 
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Fig. 5.76 Magnetic reversals 
in Antarctic deep-sea 
sediment cores correlate with 
the radiometric polarity 
timescale. This allows fossil 
zones (Greek letters) to be 
dated. Tie-lines between 
reversals illustrate the effects 
of different sedimentation 
rates (based on data from 
Opdyke etal., 1968). 
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The main magnetic mineral in oceanic basalts is 
titanomagnetite (Section 5.3.2.1). A basaltic lava is ini¬ 
tially at a temperature well above 1000 °C. Its titanomag- 
netite grains frequently have skeletal structures, 
indicating that they cooled and solidified so rapidly that 
there was not enough time for the formation of normal 
crystals. Eventually the temperature of the lava sinks 
below the Curie point of the titanomagnetite (around 
200-300 °C) and the lava acquires a thermoremanent 
magnetization (TRM) in the direction of the Earth's 
magnetic field at that time. Basalts formed contempora¬ 
neously along an active spreading ridge acquire the same 
polarity of magnetization. Long thin strips of similarly 
magnetized crust form on opposite sides of the spreading 
center. These elongated “crustal blocks” may be hun¬ 
dreds of kilometers in length parallel to the ridge axis 
and several tens of kilometers wide normal to the ridge, 
while Layer 2A - the strongly magnetic upper part - is 
only 0.5 km thick. 

Sea-floor spreading persists for millions of years at an 
oceanic ridge. During this time the magnetic held 
changes polarity many times. The alternating held polar¬ 
ity leaves some blocks of oceanic crust normally magne¬ 
tized while their neighbors are reversely magnetized. 
When the total intensity of the held is measured from a 
survey ship or aircraft, an alternating sequence of posi¬ 
tive and negative anomalies is observed (see Fig. 1.13), 
which can be interpreted in terms of the crustal magneti¬ 
zation. The anomalies can be correlated almost linearly 
between parallel prohles across a ridge system; conse¬ 
quently, the stripe-like anomalies are often referred to as 
magnetic lineations. 


5.73.2 Uniformity of sea-floor spreading 

Each anomaly in a set of magnetic lineations derives from 
a crustal block (or stripe) that formed at a ridge and was 
subsequently transported away from the spreading center. 
A magnetized crustal block forms during a period of sea¬ 
floor spreading when the geomagnetic polarity was con¬ 
stantly normal or reversed, and therefore represents a 
polarity chron or subchron. The width of a particular 
block depends on the duration of the chron and the 
spreading rate at the ocean ridge. 

The spreading rate can be determined easily close to a 
ridge (see Section 1.2.5.2). The edges of the magnetized 
crustal stripes correspond to the occurrences of polarity 
reversals, which can be correlated directly with the radio- 
metrically dated sequence for the last 3^1 Ma determined 
in lavas on the continents or islands. A plot of the dis¬ 
tance of a given polarity reversal from the spreading axis 
against the age of the reversal is nearly linear near the 
ridge; the slope of a best-fitting straight line gives the 
average half-rate of spreading at the ridge (see Fig. 1.15). 
This is half the full rate of plate separation, assuming that 
sea-floor spreading has been symmetric on each side of 
the ridge, which is often the case. 

Accumulated evidence from marine magnetic profiles 
allows us to assess the constancy of sea-floor spreading at 
different ridge systems. It is thought to have been uniform 
for the longest time in the South Atlantic. A plot of the 
distance to a given anomaly in the South Atlantic against 
the distance to the same anomaly in the Indian, North 
Pacific and South Pacific oceans contains several long 
linear segments, representing constant rates of sea-floor 
spreading in both oceans defining the line (Fig. 5.77). A 
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Fig. 5.77 Distances of 
anomalies from the ridge axis 
in the South Atlantic plotted 
against distances to the same 
anomalies from spreading 
centers in the Indian, North 
and South Pacific Oceans 
(after Heirtzler ef a/., 1968). 



Distance from the ridge (km) 


change in gradient indicates a change in spreading rate in 
one ocean relative to the other. The plot does not exclude 
a synchronous worldwide change of spreading rate in all 
oceans, but this would be a rather unlikely occurrence. 
Clearly the rate of sea-floor spreading changes from time 
to time, but for long intervals it is a remarkably constant 
process. 

5.73.3 The marine record of geomagnetic polarity history 

Investigations of magnetic anomalies in all major oceanic 
areas have given a clear, consistent record of the history 
of geomagnetic polarity during the past 155-160 Ma. It 
consists of two sequences of polarity reversals repre¬ 
sented by magnetic lineations and a long interval of con¬ 
stant normal polarity (Fig. 5.78). The sequences of 
chrons derived from magnetic lineations have been con¬ 
firmed by magnetostratigraphic research. 

The most prominent positive magnetic anomalies are 
numbered in turn from the youngest (anomaly 1 at an 
active ridge axis) to the oldest (anomalies 33-34 in the 
Late Cretaceous). The associated polarity chrons are 
identified by the same number and a letter to indicate the 
polarity. The polarity chrons in the latest sequence fall 
largely in the Cenozoic (see Fig. 4.2) and are identified by 
a leading letter C, which may be taken to stand for either 
“chron” or “Cenozoic.” The current Brunhes normal 
polarity interval corresponds to chron C1N; the reversed 
interval older than it is labelled chron C1R. Anomaly 2 
corresponds to the normal Olduvai event, which inter¬ 
rupts the reversed Matuyama interval and is identified as 
polarity chron C2N; the reversed interval older than C2N 
is called polarity chron C2R, etc. The current reversal 
sequence began in the Late Cretaceous. The oldest 
normal polarity chron in the sequence is C33N; it is pre¬ 
ceded by reversed polarity chron C33R, which ended a 
long interval (lasting about 35 Ma) in which no polarity 
reversals took place. This interval in which the geomag¬ 
netic field had a constant normal polarity is variously 
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Fig. 5.78 The geomagnetic polarity timescale since the late Jurassic, 
derived from the interpretation of marine magnetic anomalies and 
calibrated by coordinated magnetostratigraphy and biostratigraphy. The 
polarity record of the C-sequence anomalies for the past 85 Ma (CK95) 
was revised by Cande and Kent (1995); the M-sequence record from 
120Ma to 157 Ma (CENT94) is that of Channell ef a/. (1995). The record 
of reversals prior to about 155 Ma is uncertain. 


called the Cretaceous Quiet Interval, the Cretaceous 
Normal Polarity Superchron, or chron C34N. 

A phase of alternating polarity giving lineated mag¬ 
netic anomalies precedes the Cretaceous Quiet Interval. It 
began late in the Middle Jurassic and continued until the 
middle of the Early Cretaceous. These Late Mesozoic 
oceanic anomalies are referred to as the M-sequence. To 
distinguish them from the later sequence the numbered 
chrons are identified by a leading letter M. The youngest 
anomalies M0-M8 are numbered sequentially regardless 
of the magnetization polarity; older than M9 normal and 
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reverse polarity citrons are paired, as for the Cenozoic. 
The oldest securely identified chron in the sequence is 
M29N. Some older anomalies with low amplitudes have 
been interpreted as polarity chrons, but it has not yet been 
established that they represent polarity reversals rather 
than geomagnetic intensity fluctuations. 

The oldest regions of the modern oceans correspond 
to oceanic crust formed approximately 180 Ma ago, when 
Pangaea broke up and the current episode of sea-floor 
spreading was initiated. The marine magnetic anomalies 
over these areas have subdued amplitudes and they do not 
form lineations. Either no reversals happened during the 
period of initial spreading or the oceanic crust has not 
been able to retain the record. The character of the geo¬ 
magnetic field during this part of the Early Jurassic has 
not yet been definitively established. 

5.7.4 Geomagnetic polarity timescales 

The interpretation of marine magnetic anomalies pro¬ 
vides the most continuous and reliable record of geomag¬ 
netic polarity since the Middle Jurassic. The length of the 
record greatly exceeds the length of the securely estab¬ 
lished, radiometrically dated magnetic polarity timescale, 
which covers only about the past 5 Ma. Thus, it is not pos¬ 
sible to date most of the marine magnetic anomalies by 
direct correlation with a radiometrically dated polarity 
sequence. Knowledge of the spreading rate at a ridge 
system provides an alternative way of determining the age 
of the oceanic crust. Assuming that the rate of sea-floor 
spreading at the spreading center is constant, the age of a 
given anomaly can be computed by dividing its distance 
from the spreading center by the spreading rate. Elowever, 
this is an unsatisfactory method because the extrapola¬ 
tion is many times longer than the baseline. A further 
method of dating the polarity record is by establishing the 
same polarity sequence in sedimentary rocks that are 
dated paleontologically. This has been achieved in several 
investigations of the magnetic polarity stratigraphy in 
pelagic carbonate rocks. Using known absolute ages of 
major stage boundaries as tie-levels, the ages of magnetic 
polarity chrons that are too old to be dated directly are 
calculated by interpolation. 

5.7.4.1 Magnetostratigraphic calibration of polarity 
sequences 

The correlation of the radiometrically dated polarity 
sequence in continental lavas (Fig. 5.74 left) with the mag¬ 
netic anomaly record near to a spreading ridge was the 
original method used to date young marine magnetic 
anomalies. Subsequently, the polarity record was refined 
by detailed analysis of marine magnetic anomaly 
sequences. The polarity reversals were located in sedi¬ 
ments where they could be dated precisely by counting 
Milankovitch cycles in the sediments. The combination of 
improved marine magnetic record and astrochronological 
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Fig. 5.79 Crustal magnetization patterns for anomalies 29-34 
interpreted from magnetic profiles in the Indian, North Pacific and 
South Atlantic oceans, and comparison with the magnetic polarity 
stratigraphy in the Gubbio Bottaccione section (after Lowrie and 
Alvarez, 1977). K-T indicates the position of the Cretaceous-Tertiary 
boundary, which falls within reversed polarity chron C29R. 

dating gave a more reliable polarity timescale for 0-5 Ma 
(Fig. 5.74 right). 

The independent confirmation and dating of older 
marine magnetic anomalies required combined magne- 
tostratigraphical and paleontological studies in suitable 
rock formations. The dating of anomalies 29-34, which 
are the oldest found in the Cenozoic to Late Cretaceous 
sequence, illustrates this method. Along magnetic profiles 
in the Indian, North Pacific and South Atlantic oceans, 
the shapes of anomalies 29-34 are very different because 
of the different directions of the survey profiles and the 
orientations of the spreading axes. However, interpreta¬ 
tion of the anomalies gives nearly identical crustal mag¬ 
netization patterns (Fig. 5.79). 

The Scaglia Rossa pelagic limestone in the Umbrian 
Apennines of Central Italy was deposited almost contin¬ 
uously from the Late Cretaceous to the Eocene. Rock 
magnetic analysis showed that the limestone contained 
an easily defined stable component of characteristic 
remanent magnetization. Samples were taken at approxi¬ 
mately 0.5-1 m stratigraphic intervals in a long section 
through the limestone exposed in the Bottaccione gorge 
near Gubbio. The declinations and inclinations of the 
stable magnetization, after simple tectonic corrections, 
were used to calculate the latitude of the virtual geomag¬ 
netic pole (VGP) during deposition of the limestone. In 
times of normal polarity the VGP latitude is near 90°N, 
during reversed polarity it is near 90°S. The fluctuations 
of VGP latitude clearly define magnetozones of normal 
and reversed polarity (Fig. 5.80). The Gubbio polarity 
record in a 200 m thick section of pelagic limestone cor¬ 
relates almost perfectly with the oceanic polarity 
sequence derived from anomalies 29-34, measured in 
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Fig. 5.80 Magnetostratigraphy and biostratigraphy in the Bottaccione 
section at Gubbio, Italy (after Lowrie and Alvarez, 1977). 


different oceans on profiles that are hundreds of kilome¬ 
ters long (Fig. 5.79). The limestone magnetostratigraphy 
confirms independently this part of the oceanic magnetic 
polarity record. 

Paleontological studies in the Gubbio section gave the 
locations of important planktonic foraminifera zones 
and enabled the positions of the major stage boundaries 
to be located relative to the polarity sequence. The 
absolute ages of some important stage boundaries are 
known from independent radiometric and stratigraphic 
work. This enabled calculation of the absolute ages of the 
polarity reversals in the Gubbio section and, by correla¬ 
tion, the ages of the corresponding parts of the ocean 
floor. For example, the Santonian-Campanian boundary 
(with an age of about 83 Ma) lies close to the old edge of 
the reversed polarity chron C33R; the geologically impor¬ 
tant Cretaceous-Tertiary boundary (age 65 Ma) falls 
within reversed polarity chron C29R. 

In this way the geomagnetic polarity sequence in the 
Late Cretaceous and Paleogene has been confirmed in 
magnetostratigraphic sections, and the locations of many 
major and minor stage boundaries have been correlated 
to the polarity sequence paleontologically (Fig. 5.81). 
Reliable absolute ages are available for some of the stage 
boundaries, which can then be used as calibration levels. 
The ages of magnetic reversals between the dated tie- 
points are computed by interpolation, giving a numerical 
geomagnetic polarity timescale (see Fig. 5.78). In the 
same way, overlapping magnetostratigraphic sections of 
Early Cretaceous and Late Jurassic age have permitted 



Fig. 5.81 Confirmation and calibration of the oceanic magnetic 
reversal record in Paleogene and Cretaceous magnetostratigraphic 
sections in Umbria, Italy (after Lowrie and Alvarez, 1981). 


independent confirmation and dating of the M-sequence 
polarity record. 


5.7.42 Reconstruction of plate tectonic motions 

Once the ages of magnetic anomalies are known, a map 
showing the positions of dated anomalies is equivalent to 
a chronological map of the ocean basins (Fig. 5.82). The 
different rates of sea-floor spreading are evident from the 
separations of the isochrons. The oldest domains of the 
oceans (about 180 Ma) are found in the Atlantic ocean 
adjacent to the coastlines of North America and Africa, 
and in the North Pacific ocean. They are very much 
younger than the oldest rocks from the continents, which 
are up to 3.6 Ga old. The oceanic crust has been entirely 
produced since the onset of sea-floor spreading and the 
anomaly patterns reflect plate motions. 

The past motions of the major plates can be obtained 
in some detail from the anomaly ages given by a geomag¬ 
netic polarity timescale. The relative positions of the con¬ 
tinents at any time since the late Mid-Jurassic can be 
reconstructed by matching coeval marine magnetic 
anomalies formed at the same spreading center. The pro¬ 
cedure is similar to the reconstruction of supercontinents 
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Fig. 5.82 Map of the age of 
oceanic crust, as interpreted 
from marine magnetic 
anomalies (simplified after 
Scotese ef al., 1988). 



Fig. 5.83 Reconstruction of 
the history of opening of the 
North and Central Atlantic 
oceans. The figure shows the 
relative positions of Europe 
and Africa with respect to 
North America at specific 
times before the present (after 
Pitman and Talwani, 1972). 



by matching coastlines, 500 fathom depth contours or 
apparent polar wander paths. 

The method is illustrated by the sea-floor spreading 
between North America and Africa in the Central 
Atlantic (Fig. 5.83). Anomaly 33 forms a long stripe on 
each side of the Mid-Atlantic Ridge and subparallel to 
it. The anomaly is due to the magnetization contrast 
between chron C33N and the reverse polarity chron 
C33R which marks the end of the Cretaceous Quiet 
Interval. Its age is about 81 Ma. The anomaly on the 
west side of the ridge was formed at the same time as the 
anomaly on the east side, when the newly formed crust 
was magnetized at the ridge axis. If the African and 
North American plates are moved closer together until 


the east and west anomalies overlap, or until they match 
along their lengths with minimum misfit, the continents 
will be brought into the same positions relative to 
each other that they occupied around 81 Ma ago. By 
repeating this process of matching dated anomalies it is 
possible to reconstruct the successive relative positions 
of the African and North American plates as they 
separated. 

Marine magnetic anomalies in the North Atlantic can 
be used likewise to describe the relative plate motions 
between Europe and North America. A picture evolves 
of the separate histories of separation of Africa and 
Europe from North America. The differences between the 
European-North American and African-North American 
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Fig. 5.84 The number of geomagnetic polarity reversals per Myr during 
the last 160 Myr, computed in 4 Myr intervals from the composite 
reversal record in Fig. 5.78. 


plate motions permit the history of relative motion 
between Africa and Europe to be inferred. In the Late 
Cretaceous and Early Tertiary Africa moved eastwards in 
a giant shear motion relative to Europe, but since the 
Middle Tertiary the motion of Africa has been one of con¬ 
vergence and collision with the European plate. This is 
compatible with the formation of the Alpine fold belt and 
the present-day seismicity in the Alpine region. 

5.7.5 Frequency of polarity reversals 

Visual examination of the magnetic polarity record in 
Fig. 5.78 shows that the rate of polarity reversals has been 
quite variable in the last 160 Ma. A simple way to portray 
the reversal rate is to count the numbers of reversals in 
successive bins of equal duration (e.g., 4 Myr) and 
compute the average number per Myr. The plot of rever¬ 
sal rate against age (Fig. 5.84) is uneven, but has some dis¬ 
tinct features. 

The reversal frequency during the M-sequence aver¬ 
aged about 3-4 reversals/Myr. From about 130 Ma ago 
the reversal rate slowed down until the beginning of the 
Cretaceous Normal Polarity Superchron (CNPS) at about 
121 Ma ago. There were no reversals for the following 38 
Myr. After the end of the CNPS about 83 Ma ago the 
reversal rate was at first very low, but gradually speeded 
up. It reached a peak of about 5 reversals/Myr in the Late 
Miocene, about 10 Ma ago, and has since decreased 
slightly to about 4 reversals/Myr. The current interval of 
normal polarity (known as the Brunhes or chron C1N) 
has lasted 0.78 Myr, which is much longer than the mean 
length of polarity chrons in the Late Tertiary. 


In 1968 A. Cox theorized that reversals are random 
events, triggered by unknown mechanisms that affect the 
fluid motions in the Earth’s liquid outer core. With a 
random reversal process there would be no continuity 
between successive reversals; as soon as a reversal was com¬ 
pleted, the next one would be as likely to occur immediately 
as at any time later. Because there would be no waiting 
period, this type of mechanism would generate a large 
number of very short polarity chrons and a small number 
of long chrons. The frequency distribution of polarity 
chron durations should decrease exponentially with 
increasing chron duration. In fact this model does not fit 
the observed lengths of polarity intervals precisely. The 
polarity sequence contains comparatively few short chrons, 
a lot of medium duration chrons and few long chrons. This 
may imply that the process that causes reversals is not com¬ 
pletely random. A distinct length of time may elapse after a 
reversal for the fluid motions to recover sufficiently to allow 
the next reversal to happen. However, the mechanism that 
causes a reversal is inadequately understood. 

5.7.6 Early Mesozoic and Paleozoic reversal history 

Because of the availability of the excellent marine magnetic 
anomaly record, it has been possible to construct a well 
dated history of geomagnetic polarity in the last 155-160 
Ma (see Fig. 5.78). The determination of a geomagnetic 
polarity timescale for eras older than the Middle Jurassic is 
more complicated, because no comparable oceanic record 
exists. Paleomagnetic results show numerous reversals 
during the Triassic, but the Permian and Late Carbon¬ 
iferous were dominated by reversed polarity. A long inter¬ 
val of constant reversed polarity - the Kiaman interval - is 
a distinctive feature of the Paleozoic polarity record. 
Earlier in the Paleozoic reversals were common. Although 
magnetostratigraphic investigations of many formations 
are in progress, no unique record of the polarity succession 
is available for the Early Mesozoic or Paleozoic. 

At present it is only possible to analyze Paleozoic polar¬ 
ity history in terms of the bias toward normal or reverse 
polarity. The polarity bias broadly defines superchrons 
(Fig. 5.85). When as many reversed as normal magnetiza¬ 
tions are found it is assumed that the field polarity has been 
reversing; the interval is called a mixed polarity superchron. 
The Late Cretaceous and Cenozoic and the Triassic illus¬ 
trate mixed polarity superchrons. Sometimes, for unknown 
reasons, the polarity was constant for long intervals. This 
was the case during the Cretaceous Quiet Interval, which in 
terms of polarity bias is called the Cretaceous Normal 
Polarity Superchron. The Kiaman reversed polarity inter¬ 
val is the same as the Permo-Carboniferous Reversed 
Polarity Superchron. The derivation of a more detailed 
history of geomagnetic polarity for Early Mesozoic and 
Paleozoic time is a massive task for paleomagnetists and 
biostratigraphers. Because of the absence of a marine 
magnetic record each polarity sequence will have to be 
confirmed by repetition in several magnetostratigraphic 
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Fig. 5.85 Geomagnetic polarity bias superchrons in the Paleozoic, 
Mesozoic and Cenozoic (after Harland etal., 1982). 


sections before it can be accepted as globally significant 
and representative of dipole field behavior. 
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5.9 REVIEW QUESTIONS 

1. What is the evidence that the geomagnetic field origi¬ 
nates inside the Earth? 

2. Why is the geomagnetic field at the Earth’s surface 
mainly a dipole field? 

3. What is the non-dipole geomagnetic field? Flow large 
is it compared to the dipole field at the Earth’s 
surface? Is its importance relative to the dipole field 
greater or less at the core-mantle boundary? 

4. What is the magnetosphere? How does it originate? 

5. What are the Van Allen belts? How are they formed? 

6. Why do electrically charged particles from the solar 
wind have curved paths in the Earth's magnetic 
field? 

7. The geomagnetic field has a large anomaly over the 
South Atlantic, where its intensity is weakened by 
about 20%. What effects could this anomaly have on 
extra-terrestrial radiation that impinges on the 
Earth? What might be the consequences for (a) 
Earth-orbiting satellites, (b) astronauts, (c) passen¬ 
gers and crew in high-flying aircraft. 

8. What is the ionosphere? What effect does it have on 
the magnetic field measured at the Earth’s surface? 

9. What is a magnetic storml What is its cause and what 
are its effects? 

10. Describe the principle of operation of (a) the flux- 
gate magnetometer, and (b) the proton-precession 
magnetometer. 
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11. Which corrections must be made to magnetic survey 
measurements in order to define a magnetic anomaly? 

12. Describe how the magnetic anomaly of a very wide, 
vertically magnetized thin sheet varies with position 
from one side of the sheet to the other. Where is the 
anomaly largest and where is it smallest? 

13. What are diamagnetism, paramagnetism, ferromag¬ 
netism, ferrimagnetism, and antiferromagnetism? 

14. Which type of magnetism is exhibited by (a) quartz, 
(b) calcite, (c) clay minerals, (d) magnetite, and (e) 
hematite? 

15. What is remanent magnetization? What types of rema¬ 
nent magnetization are possible in (a) sedimentary, 
(b) metamorphic, and (c) igneous rocks? Explain how 
they originate. 

16. Explain what is wrong with the following explanation 
of the origin of thermoremanent magnetization 
(TRM): “When cooled below the Curie point the 
magnetite grains align with the magnetic field to give 
a TRM.” 

17. What are (a) the declination and (b) the inclination of 
the magnetic field at a given location? What is the direc¬ 
tion of the magnetic field at the magnetic equator? 

18. What is implied if the inclination of the remanent 
magnetization of a rock differs from the inclination 
of the magnetic field at the same location? What is 
implied if the declinations differ? 

19. EIow might a rock acquire more than one component 
of remanent magnetization? 

20. EIow can the original direction of remanent magneti¬ 
zation be identified in a rock that has more than one 
magnetization component? 

21. What is the geocentric axial dipole (GAD) hypothesis 
and why is it important for paleomagnetism? 

22. What is an apparent polar wander (APW) path? Why is 
it an apparent path? How are APW paths for different 
continents interpreted in terms of global tectonics? 

23. What is magnetic polarity stratigraphy? How is a 
magnetic polarity stratigraphy calibrated for young 
rocks and for old rocks? 

24. How do oceanic magnetic anomalies originate? 
Explain the survey procedure that might be used to 
measure them. 

25. What is a geomagnetic polarity timescale (GPTS)? 
How is a GPTS constructed and calibrated? 


5.10 EXERCISES 

1. Assuming that the geomagnetic field corresponds to 
a geocentric axial dipole, calculate the latitude of a 
site where the field inclination is 45°. 

2. With the same assumption, calculate the inclination 
of the geocentric axial dipole field at latitude 45°N. 

3. The magnetic moment of the Earth’s geocentric 
dipole is 7.7674X 10 22 A m 2 . Assuming an axial 


dipole, calculate the total geomagnetic field intensity 
as a function of latitude. What is the latitudinal varia¬ 
tion of the total field in nT km -1 at 30°N? 

4. Compute the inclination and declination of the mag¬ 
netic field that would be observed in Boulder, 
Colorado (40°N, 105°W) if the Earth’s field corre¬ 
sponded to a perfect geocentric dipole whose axis 
penetrates the Earth’s surface at 80°N, 72°W. 

5. Show that, for a small displacement along a meridian 
of magnetic longitude at magnetic latitude 45°N, the 
change of inclination is exactly 4/5 the change in lati¬ 
tude. 

6. The north magnetic pole is at 77°N 102°W, the south 
magnetic pole is at 66°S 139°E. 

(a) Why are the poles not antipodal (exactly opposite)? 

(b) What is the closest distance between the center of 
the Earth and the straight line joining the poles? 

7. Measurements of the magnetic field elements at a 
geomagnetic observatory gave the following results: 
N-component 27,000 nT; E-component -1800 nT; 
V-component -40,000 nT. 

(a) Is the observatory in the northern or southern 
hemisphere? 

(b) What is the total field intensity at the site? 

(c) What are the local values of inclination and decli¬ 
nation of the field? 

8. The IGRF for 2005 gives the values of the Gauss 
coefficients for the dipole and quadrupole compo¬ 
nents of the geomagnetic field shown in the following 
table. 


Gauss coefficients [nT] 

*i 

-29,557 

g°2 

-2,341 

?! 

-1,672 

g\ 

+ 3,047 

h\ 

+ 5,080 

h\ 

-2,595 



si 

+ 1,657 



/?2 

-517 


The mean squared value, R n , of the intensity of the 
geomagnetic field at the Earth’s surface due to the 
component of degree n is given by 

*,,=(«+^sfc^+w) 2 ) 

m=0 v ’ 

(a) Calculate the root mean square intensity of the 
dipole component of the field at the Earth’s 
surface. 

(b) Calculate the corresponding intensity of the 
quadrupole component and express it as a per¬ 
centage of the dipole field intensity. 

9. The values of the Gauss coefficients in the previous 
exercise are given for the Earth's surface. Recalculate 
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the root mean square intensities of the dipole and 
quadrupole field components at the core-mantle 
boundary (Earth's radius 6371 km, core radius 
3485 km). 

10. What are the values of n and m in the designation T" 1 
for the spherical harmonic functions illustrated in 
Fig. B5.3.2 in Box 5.3? Sketch how these patterns 
would appear on the opposite side of the reference 
sphere to the one you are looking at? 

11. In an aeromagnetic survey at a flight altitude of 2000 
m above sea-level, the maximum total field anomaly 
over an orebody was 30 nT. In a repeated measure¬ 
ment at 2500 m altitude the maximum amplitude of 
the anomaly was 20 nT. Calculate the depth of the 
orebody below sea-level assuming (i) a monopole 
source and (ii) a dipole source for the anomaly. 

12. The vertical field magnetic anomaly A B_ over a verti¬ 
cally magnetized anticline (represented by a horizon¬ 
tal cylinder) is given by Eq. (5.54). Draw a sketch of 
the anomaly on a profile normal to the structure. 
Observe the horizontal positions where the anomaly 
is zero. 

(a) Calculate the horizontal positions where the 
anomaly has extreme values. 

(b) Calculate the peak-to-peak values of the anomaly. 

13. Assume that the core of an anticline is made of 
basalt, the host formation is limestone, and the rocks 
are vertically magnetized with susceptibilities given 
by the median values in Fig. 5.13. 

(a) Compute the induced magnetization contrast 
when the vertical magnetic field intensity is 
40,000 nT. 

(b) If the anticline is modelled by a horizontal cylin¬ 
der whose radius is one fifth the depth of its axis, 
calculate the maximum amplitude of the vertical 
field anomaly over the structure. 

14. Assuming the gravity anomaly over an anticline 
as given by Eq. (2.94), apply the Poisson relationship 
(Box 5.5) to obtain the horizontal field magnetic 
anomaly A B x of a vertically magnetized anticline 
with radius R and magnetization contrast AM.. 

15. The north (AO, east ( E) and vertical ( V) components 
of a magnetization can be calculated from its inten¬ 
sity (M), declination (D) and inclination (/) using the 
following relationships: 

N = McosIcosD 
E = Mcos/sinD 
V= Msin/ 

In the progressive thermal demagnetization of a 
sample of Cretaceous limestone for a paleomagnetic 
study the remanent magnetizations given in the table 
were measured at different temperatures T. 


T[°C] 

M[10~ 5 Anr 1 ] 

D[°] 

ni 

200 

5.08 

60.8 

60.1 

300 

3.87 

62.1 

59.8 

400 

3.02 

61.2 

62.1 

500 

2.10 

62.2 

61.6 

550 

1.18 

63.1 

60.9 


(a) Calculate the north (AO, east ( E) and vertical ( V) 
components of the magnetizations at each 
demagnetization stage. 

(b) Plot the A-components against the .E-components, 
fit a straight line, and determine the optimum 
declination for the stable magnetization direction. 

(c) Plot the F-components against either the A- or 
.E-components, fit a straight line, and compute 
the optimum inclination for the stable magnetiza¬ 
tion direction. 

(d) The straight lines do not pass through the origin 
of the plot. What does this imply? 

16. In the same paleomagnetic study, the following stable 
directions of remanent magnetization, corrected for 
the dip of bedding in the limestone formation, were 
measured in five samples from the same site: 


Sample 

D[°] 

I[°] 

SR-04 

329.7 

40.6 

SR-05 

336.6 

24.7 

SR-07 

326.2 

46.0 

SR-10 

321.1 

40.9 

SR-12 

322.7 

44.9 


(a) Calculate the direction cosines (A^, \ E , A v ) of the 
north, east and vertical components of each 
stable direction, using the relationships 

\ N = cos/cos D 
\ E = cos/sin D 
A v = sin/ 

(b) Add up the values for each direction cosine. Let 
the sums be X, Y and Z, where 

z=^ r 

Calculate the vector sum of the directions, R, and 
the declination D m and inclination I m of the 
mean direction using the relationships 

R = a/Z 2 + Y 2 + Z 2 

tan D m = Y/X sin I m = Z/R 

(c) Using the computed value of R , calculate the 
precision parameter k of the data and the 95% 
confidence error (a g5 ) for the mean direction. 
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17. The samples analyzed in the previous exercise were 
gathered at a site in Italy with latitude 43.4°N, longi¬ 
tude 12.6°E. Calculate the latitude and longitude of 
the paleomagnetic pole position for the Italian lime¬ 
stone. 

18. Assume that the location of the Late Cretaceous 
paleomagnetic pole for the European plate is at 72°N, 
154°E and the corresponding pole for the African 
plate is at 67°N, 245°E. 


(a) Calculate the expected ‘European' and African’ 
directions at the Italian site in the previous exer¬ 
cise. 

(b) Compare the expected directions with the 
observed directions and explain how the paleo¬ 
magnetic results from the Italian limestone 
should be interpreted. 



Appendix A The three-dimensional wave equations 


Derivation of general equations of motion 


The development of the equations for P-waves and S-waves 
in three dimensions begins like the one-dimensional cases. 
In this case, however, the displacement of a particle during 
the passage of a seismic wave is a vector U with compo¬ 
nents u, v, and w in the x-, y-, and z-directions, respectively. 

Consider the forces acting on the faces of the small 
box shown in Fig. A1. The faces normal to the x-, y-, and 
z-directions have areas A x , A y , and A„ respectively. 
Let the force on the left face of area A x , acting in the x- 
direction, be (F x ) x and that on the right face be ( F x ) x + 
MF X ) X . If these forces are equal (A(i 7 v ) T = 0) they will 
cause no motion in the x-direction. If they are unequal, 
motion will result. If the box is small enough for the 
change in force to take place uniformly, we can write 


A (F x ) = 


S(F X ), 

dx 


Ax = 


d ( (r xAx) 

dx 


dcr. 


Ax = (AyAz)Ax 

(Al) 


The forces acting on the faces normal to the z- 
direction also have x-components. Treating them in the 
same way, the force on the base of the box of area A z , 
acting in the x-direction, is ( F,) x and that on the top face 
is (F_) x + A(F z ) x . The difference is 


d(F.) 

A (*■*), =-W' 


Z’X A d AzAz) A 8 ^ZX / A A A A 

Az = ^— Az = (AxAy) Az 


(A2) 


Analogously (not shown in the figure), the resulting 
force in the x-direction front the stresses on the faces 
normal to the y-direction is given by 


A (F y ) x 


HF V ) X 


Sy 


'-Ay 


dv 


Ay 


8a vx 

-^-(AzAx)Ay 

(A3) 


Combining these results, we get the net force AF v 
acting on the box in the x-direction: 


A F = 


d(T xx 

dx 


d<r y* 

dy 


d(T z> 

dz 


AV 


(A4) 


where A V= Ax Ay Az is the volume of the elementary box. 
If the density of the medium is p and the x-component 
of the displacement is u, we can apply Newton’s law of 
motion to obtain the equation of motion 



box. 


da xx 

dx 


dcr. 


dox 


,d 2 U 


~dy + ~dz i JAF= pAF^f 


da 


XX 

dx 


dy 


■ + ■ 


d fjx 

dz 


d 2 U 

" P 1A 


(A5) 


From the definitions of the Lame constants (Section 
3.2.4.3) and the relationships between stress and strain 
(Eq. (3.32)), we have 


\6 + 2fj,e xx 



dv , dw \ 
dy dz) 


+ 2/jl 


du 

dx 


2 ^Syx 



du\ 

dy) 


(A6) 


2/J-e- v = M 1 


dW 

dx 


du \ 

dz) 


Differentiating each of these equations in turn, as in 
the equation of motion, gives 


d(T xx 

dx 


= A 


d6 . 0 d 2 u 

dx + ^dx 2 
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><r yx _ / d 2 v d 2 u \ 

dy ~^\dydx + dy 2j 

lfr -x _ ( d 2 w . d 2 u\ 

dz ~ p '\dzdx + dz 2 ) 


(A7) 


(A + p) 


/ d 2 e d 2 e d 2 ^ 

\dx 2 dy 2 dz 2 ) 


+ f T2 (S 


dv 

By 



_ dy(du,dy,dw\ 
p df\dx dy dz) 


(A15) 


After substitution and grouping terms, the equation of 
motion for the .v-displacement u becomes 


A 


dd 

dx 


+ 2p 


d 2 U 

dx 2 



d 2 u ) , ( d 2 W 

dy 2 ) dzdx 


d 2 u \ _ d 2 u 
dz 2 )~ P dt 2 


(A + p)V 2 0 + pV 2 0 - p jjy 
(A+2p)V 2 0 = p|f 
which can be written as 


(A8) 


, <90 , (d 2 U . d 2 V , d 2 W \ . / d 2 U . d 2 li . d 2 ^ 

ax + ^W 2 + aj-a.Y + dzdx) + ^W 2 a.v 2 az 2 / 


“ 2v!e =§ 

where 


(A16) 

(A17) 


(A 18) 



(A9) 


A + 2/x 
P 


(A 19) 



a /aw ay ary \ 
p dx\dx + + flr / 



a^M , a^w'l _ 
ay 2 ar 2 / ~~ p at 2 


(A10) 


It is convenient to introduce the mathematical opera¬ 
tor V 2 = d 2 /dx 2 + d 2 /dy 2 + d 2 /dz 2 . This allows a more 
concise presentation of Eq. (A 10) for the ^-displacement, 
which becomes 

(A+/x)|| + /xV 2 M = p|^ (All) 

Similar analysis of the y- and z-displacements, v and w 
respectively, gives the additional equations of motion: 


Comparison with the one-dimensional compressional 
wave equation (Eq. (3.41)) shows that Eq. (A18) describes 
the three-dimensional propagation of a volumetric 
change (dilatation) 6 with velocity a given by Eq. (A19). 
This wave travels as a succession of dilatations and com¬ 
pressions parallel to the direction of propagation and is 
referred to as a longitudinal wave. Because it is the first to 
arrive at a seismometer it is more commonly called the 
primary wave or P-wave. 

Equation of transverse waves 

By differentiating Eq. (All) with respect toy we get 


(A+p-)ff+ p-V 2 v = p|^| (A12) 

(A + p) 1 °, + pV 2 w> = p (A13) 

Equations (A11)-(A13) can now be resolved into two 
wave motions representing longitudinal (P-) waves and 
transverse (S-) waves. 


(A + p) 


d 2 9 

dydx 





(A20) 


Likewise, differentiating Eq. (A12) with respect to x gives 


(A + p) 


d 2 9 

dxdy 





(A21) 


Equation of longitudinal waves 

Differentiating each of Eqs. (All), (A12) and (A13) in 
turn by x, y, and z, respectively, gives 






(A + p) 


d 2 9 

dv 2 





(A14) 


Subtracting Eq. (A20) from Eq. ( A21) we get 



(A22) 


By combining Eq. (A12) with Eq. (A13), and Eq. 
(A13) with Eq. (All), in the same way, we obtain the 
further wave equations 


(A + p) 


d 2 9 

dz 2 






(A23) 


Adding these equations together, and using the defini- y 2 (dw_dy] = d 2 ( dw dv j 
tions of V 2 given above and 9 = du/dx + dv/dy + dw/dz we ^ dz/ p at 2 \dy dz) 

obtain 


(A24) 
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which can be written as 

|8 2 V 2 ^=^ (All) 

(A25) where 


Define 

= (W|y] 

* \ dy dzj y \ dz dx) 

- \dx dy) 


The quantitities dv/dz, etc., are equivalent to small 
rotational distortions that combine to give shear strains 
(see Section 3.2.3.3). Thus (dv/dz - dw/dy) is a net rota¬ 
tion in the y—z plane, i.e., normal to the .v-direction. The 
quantities defined in Eq. (A25) are the components of a 
vector , 'P' = (/'P v ,T f v ,T f ,). Each component of 'P describes 
a net rotation in the plane normal to the specific coordi¬ 
nate axis. 

Equations (A22)-(A24) show that the components of 
T satisfy the equation 

M V 2 T=p^f (A26) 


/3 2 = g (A28) 

Equation (A27) has the form of the wave equation. It 
describes the propagation of a rotational or shear distur¬ 
bance 'P with velocity /3 given by Eq. (A28). Because the 
displacements in this wave are in the plane normal to the 
direction of propagation it is referred to as a transverse 
wave. Its velocity /3 is slower than the P-wave velocity a, 
so it arrives at a detector later than the P-wave from the 
same event and is called the secondary wave, or S-wave. As 
the disturbance consists of shear motions of particles in 
the wavefront, it is also called a shear wave. 



Appendix B Cooling of a semi-infinite half-space 


A semi-infinite half-space extending to infinity in the z- 
direction is initially at a uniform temperature T (y Let the 
temperature at time 1 and depth z inside the cooling half¬ 
space be T(z,t). The temperature of the upper surface at 
z =0 is changed suddenly and maintained at 0 °C. The 
boundary conditions are therefore T(0,t) = 0 and T(z,0) = 
T 0 . The temperature in the cooling half-space must satisfy 
the heat conduction equation, which can be written 
(Box 4.1) 


where the integration variable is changed to avoid confu¬ 
sion. Substituting in Eq. (B5) we get 


T(z,t) =^ T(£) e K " 2 'sin(«z)sin(«£)dH 


d£ (B8) 


Invoking the trigonometric relationship 2sin(nz)sin(«£) = 
cos(«(£ — z)) - cos(«(£ + z)) changes the integral in Eq. 
(B8) to 


1 1 dfl _ 1 d 2 Z 
K 6 dt Z dz 2 


(Bl) 


Separating the variables and setting the separation con¬ 
stant equal to -n 2 , we get 


nz,t)=^T{c)[] 


e k " 2 '(cos («(£ —z) 


- cos (»(£ + z))) d n\ d£ 


(B9) 


11 d0 

«6 df 


and 


1 d 2 Z 
Z dz 2 


The integration can be simplified by writing Kt = a, 
(B2) - z) = u, and (£ + z) = v. 


The particular solutions of these equations are 
6 = 0 O e " and Z = A n cos nz + B n sin nz 


(B3) 


T(z,t) = ^ j T(£) je “" 2 cos(«w)d« 


The boundary condition on the upper surface at z = 0 is 
T(0,r) = 0, which requires A n = 0. The general solution 
can then be written 

T(z,t) = e “ Knl 'B n sin nz (B4) 


If the temperature distribution is continuous, the summa¬ 
tion can be replaced by an integral and the discrete values 
(fl 0 , B n ) by a continuous function B(n) 


— e “"cos (nv) d// 




(BIO) 


Before proceeding with the solution of Eq. (BIO) it is first 
necessary to evaluate the integral 


7 = 


e “" 2 cos(«m) dn 


(Bll) 


Note that 


T(z,t) = J e Knl, B(ri) sin (nz) An 
o 


(B5) 


^~= J (~fie a " 2 ) sin (mi) An 
o 


(B12) 


If at t = 0 the cooling half-space has an initial tempera¬ 
ture distribution T(z), 

T(z, 0) = T(z) = J B(n) sin (nz) d n (B6) 

0 


Integrating Eq. (B12) by parts gives 


dY_ 

du 


2a 


- sin (nu) 


Jo 


u 

2 a 


e anl cos (nu) d n = 


_ U y 

2 a 


(B13) 


By using the orthogonality properties of sine and cosine 
functions (see Box 2.3), we get 

b( B) = lj T’(z) sin (nz) dz = ^ j 7Y£)sin(«£)d£ (B7) 

0 0 


/va— u 

Ydu ~ du ln( Y) ~ 2a 

ln ( y)= “£ + C= “£ + ln( ^°> 
7= 7 0 e" 2/4 “ 


(B14) 

(B15) 

(B16) 
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The constant Y 0 is the value of the integral Y defined in 
Eq. (B11) for u = 0. This integration (evaluated below) 
has the value 

Y 0 = je-“» 2 d« = i^ (B17) 

0 

Thus Y= |e _ “" 2 cos(«M)d ;7 = ^'^^e _ “ 2/4 “ (B18) 

o 

Inserting this solution in Eq. (BIO) gives 


The expression in brackets is the error function (Box 4.3), 
defined as 


erf ( 17 ) 



e " 2 d« 


(B24) 


The solution for the temperature distribution in the 
cooling plate is therefore 


T(z,t) = 7^ erf 



(B25) 


0 

ZV'TVKtJ 

' n 


Q — u 2 /4a Q — v 2 /4a 


d£ 


*-(£— z) 2 /4/ct _ e — (£+z) 2 /4ict 


d<T (B19) 


If the cooling body has initially a uniform constant tem¬ 
perature T 0 , then T(z) = T 0 and the integration in Eq. 
(B19) can be written 


T(z,t) = 


l^TTKt 


e -(z-o 2 /4 K?df- e ^ t+ n 2/4K 'dn (B20) 


In the first integration we can write w = (f — z)/2^i<t\ then 
dvr = l/2^[i<td£ and the upper and lower limits of the inte¬ 
gration change to °° and — z/2a/k 7, respectively. Similarly, 
in the second integration we can write w = (£ + z)/ 2 a/k?, 
with corresponding changes to the integration limits, 
which become 00 and z/2^Kt, respectively. Equation (B20) 
can be rewritten as 


T(z,t) 


To 

■\[tt 



e~ w2 dw 


—z z 

2^Kt 2-^Kt 


(B21) 


Evaluation of the integral Y 0 in Eq. (B17) 

Y o = J e““ A 2 d.v = I e-“J' 2 dy (B26) 

0 0 

(T 0 ) 2 = ^ Je _ “* 2 dx j ^ J e~ a}i2 dy j = J J e _ “(* 2 + J' 2 )dxdy 

0 0 00 

(B27) 


This is an integration over an area enclosed by the posi¬ 
tive x- and y-axes ( 0 si<oa; 0 <y<co). Change to polar 
coordinates x = r cosi p and y = r sin<p and write the 
element of area as (dx dy) = (r dr dip). The limits of the 
integration change to 0 < r < 00 ; 0 ^ <p=s (tt/2 ): 


(T 0 ) 2 = J je ar2 rdrd<p= j je m 'Vdr 
0 0 

X) 

dcp 


d cp 


0 0 

TT/2f 


I 

0 

-rr /2 


a-ar 2 

2a 


p — ar z 00 1 

( r o) 2 =j[-Vl^ = ijdp 


TT 

4a 


(B28) 

(B29) 


The value of the integral in Eq. (B17) is thus 



(B30) 


(B23) 
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Earth 4,5 
age estimation 
geochronology 207-220 
history 208-210 

cooling 210,221,231-233,243-244 
creep mechanisms 110 
density 47—48,186 

radial variation 192-193 
as dynamic planet 15-40 
early 220 

electrical conductivity 274—275 ,276 
electrical properties 255-256 
surveying 256 
elliptical orbit 3B, 7, 14 
and Milankovitch cycles 59-60 
external heat 230-231 
figure 57, 61-62, 65 
formation 220 
free oscillation 137-140 
gravitational potential 62-63 
gravity, radial variation 193 
heat 220-252 
flow 229-252 
sources 227-230 
transport 224-227 
imperfect elasticity 128-130 
internal structure 20-21, 20, 186-201 
history 186-187 
models 193-197, 194 
internal temperature 222-224 
International Reference Ellipsoid 61, 65 
magnetic field see geomagnetism, Earth’s 
magnetic field 
mass 8,47 
obliquity 59 

polar flattening 44-45, 61-62 
pressure, radial variation 193 
rotation 7, 14 , 48-61, 62 
changes 54-60 
seismicity 27,150-151 
shape 44-45, 61, 62 
size 43-44 

Earth-Moon barycenter 50-52 ,51 
Earth-Moon distance 8, 56-57,210 
Earth-Sun distance 1-2 ,1 
Earth-tides 50, 54 
earthquakes 

Alaska (1964) 160,162, 164 
at constructive margins 26 
Chile (1960) 160,140, 164 
control 169-170 


damage 160 
elastic deformation 123 
energy release 166-167,221 
epicenter 149-150 
fault-plane solutions 153 
focal depth 149,150 
focal mechanisms 151-158 
active plate margins 155-157 
continental collision zones 157-158 
subduction zones 156-157 
free oscillation 137-140 
frequency 166 

intensity 160-162, 161 , 165-166 
intraplate 150 

Kamchatka (1952) 140, 164 
magnitude 162-166, 164 
Mexico (1985) 162, 164 
prediction 167-169 
San Francisco (1906) 160,162, 164 
seismology 148-171 

Sumatra-Andaman Islands (2004) 140, 141, 
158,159,160, 164 , 165 
Tangshan (1976) 148, 164 
zones 21-22,27 
eccentricity, orbital 59 
eclipses 55-56 
ecliptic plane 3B, 4 
eikonal equation 134 
elastic constants 126 
elastic limit 107,122 
elastic rebound model 148 ,148 
elasticity theory 122-130 
electric current 253 
electric field 253-254 
Earth 255-256 
electric potential 253 
electricity 252-276 
electrode polarization 266 
electrodes, potential 260-262 
electrolyte 255 

electromagnetic induction 269-271 
electromagnetic surveying 268-274 
electron 253 
conduction 225 
capture 215 

elevation correction, gravity measurement 78, 

80 

ellipse 3B 
ellipsoid 

oblate 44-45, 61-62, 65-67 
International Reference Ellipsoid 61, 65 
ellipticity, dynamical 58,62 
Emperor Seamount volcanic island chain 32-33, 
33 

energy, potential 45^46 
energy levels 316, 323-324 
entropy 222 
Eotvos acceleration 60 
ephemeris time 208 
epicenter 149-150 
equinoxes 3B, 208 
precession 58, 59 
equipotential surface 48 
Eratosthenes (275-195 BC), circumference of 
Earth 43 
Eris 12,13B 
Eros 9 

error function 234B, 235 
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Euler, Leonhard (1707-83) 34 
nutation 57 

spherical trigonometry 34, 35B 
Euler pole of rotation 28,29, 34-38, 346-347 ,347 
Europa 9,10 

Ewing piston corer 238 ,239 
expansion, volume coefficient 222 
explosions, nuclear, monitoring 170-171 

Faraday, Michael (1791-1867) ‘magneto-electric’ 
induction 282-283 
Faraday’s law 268 
Farallon plate 31, 32 
faults 

dip-slip 154 
listric 154 

normal 149, 154 ,155 
overthrust 154 
reverse 154 ,155 
strike-slip 154 
thrust 154 
transcurrent 155 

transform 22, 28-29,148,155, 155,156 
earthquake focal mechanisms 155-156 
transform force 38 ,39 
see also triple junctions 
faulting, mechanisms 154-155 
Fermat’s principle 173-174 
ferrimagnetism 289, 290, 291, 292 
in rock 293-294 
grain-size dependence 296-299 
minerals 296 
ferromagnetism 289,290 
Curie temperature 291 
parasitic 291 
flow 

heat see heat, flow 
laminar 106-107 
in liquids 106-107 
plastic 110 
in solids 107 
viscous 106-107 
fluid, Newtonian 107,110 
flux 

heat 222,225 

magnetic flux density 281, 283 
foreshock 148 

Fourier, Jean Baptiste (1768-1830), heat 
conduction 230 
Fourier analysis 86B 
Fourier integral 88, 89B 
Fourier series 86B, 87 
Fourier transforms 88-89, 89B 
fracture zones 22,28 
Franklin, Benjamin (1706-90), studies in 
electricity 252 
Fraunhofer diffraction 190 
free-air gravity anomaly 83-84 
free-air gravity correction 78,79-80 
free-fall gravity measurement 74-75 
Fresnel diffraction 190 

7 -rays 211,228 

Galileo Galilei (1564-1642) 2-3,9,10,11 
gamma-gamma logging 81 ,82 
Ganymede 9 

Gauss, Carl Friedrich (1777-1855), Earth’s 
magnetic field 283, 306, 309 


Gauss coefficients 309 
geochemical anomaly, hotspot volcanism 34 
geochronology 207-220 
geodesy 61 
satellite 67-73 
geodynamo 315-316 
geoelectricity 252-276 
geoid 66-67 

satellite measurement 70-71 
undulations 67, 69, 70 
geology, timescale 208 ,209 
geomagnetic dynamo 315-316 
geomagnetic pole 311 
apparent polar wander 18, 37-38 
multipole 310B 

virtual 18, 336-337, 344-345, 345B, 350, 355 
geomagnetic potential, multipole expression 309, 
310B 

geomagnetism 305-320 
Earth’s magnetic field 
altitude variation 326 
dipole 311-312 
secular variation 313 
diurnal variation 307-309, 326 
external 305-309 
internal 309, 310-312, 314-316 
latitude and longitude variation 326 
multipole 310B 
non-dipole 312, 314 
origin 314-316 

polarity change 23-24, 316, 349-359 
secular variation 312-314, 335-336 
time-averaged 334-335 
see also paleomagnetism 
geomagnetometry 309 
geopotential 63,67 
Giant Impact hypothesis 220, 317 
Gilbert, William (1544-1603) studies in 
magnetism 282 

Global Depth and Heat Flow model 239, 244, 
245B 

global positioning systems 69-70,221 
Goddard Earth Model 67 
goethite 295,302 
Gondwanaland 16 
computer reconstruction 17 ,18 
paleomagnetic reconstruction 19, 348-349 ,349 
gradiometer, magnetic 325 
granite, radiogenic heat 228 
gravimeters 76-77, 76,82 
gravimetry, borehole 81-83 
gravitation 45^18 
acceleration 46^47 
constant 4,45 

law of universal gravitation 3-4,44,45-46, 50 
potential 46-47 
gravity 

absolute measurement 74—76 
free-fall 74-75 
rise and fall 75-76 
anomalies 73-99 
at ocean ridges 27 
at subduction zones 28 
Bouguer 83-84, 90, 95-99,103-104 
continental and oceanic crust 95-96 
double Fourier series 87B 
filtering 87-89, 90 
Fourier analysis 86B 


free-air 83-84,240 
interpretation 84—99 
isostatic 103-105 
modelling 89-94 
mountain chains 96-97 
oceanic ridge 97-99 
regional 84-85,95-99 
residual 84—85 
subduction zone 99 
measurement correction 77-80 
normal 63-66 
potential 63, 65 
radial variation 193 
relative measurement 76-77 
satellite measurement 70-71 
surveying 77 
Switzerland 104-105 
ground roll 181 ,182 
Griineisen parameter 223 
Gubbio, Italy 
extra-terrestrial irridium 9 
polarity record 355-356 
Gutenberg seismic discontinuity 186,197 
guyots 33 

gyromagnetic ratio 322 
half-life 212,213 

half-space, semi-infinite, cooling 243-244, 
366-367 

harmonics, spherical 65B, 67,138, 306, 309, 310B 
Hawaiian Ridge, volcanic island chains 32-33, 
33,34 

head wave 182 
heat 220-252 
conduction 230,232B 
definition 221 
flow 222,225,231-237 
continental crust 233,235-237, 242 
global 241-243 ,242 
ocean ridges 26-27,231-233,239-240 
oceanic crust 237-243, 242 
provinces 229 

subduction zones 28,246-247 
flux 222,225 

geothermal sources 227-230, 230 
radiogenic 228-229 
specific 222 
transport 224—227 
hematite 294,295, 301, 302 , 303, 304 
Himalayas, earthquake focal mechanisms 157-158 
Hipparchus (190-120 BC) 
precession of equinoxes 58 
trigonometry 1 

Hooke’s law 76,122, 123, 126,129B 
hotspots 32-34, 33, 37-38, 39, 197,251-252 
hotspot force 39 

Huygens’principle 171 173,190 
hydrothermal vents 240 

Iapetus Ocean 19 
ilmenite 295 

incidence, angle of 172,173,174,175 
inclination, magnetic 282, 306 
inertia 12,48 
insolation 59 
insulator 253 

interference, synthetic aperture radar 72 
interferometry 
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Michelson, in gravity measurement 74, 75-76 
very long baseline 73,221 
International Geomagnetic Reference Field 
(IGRF) 309, 312, 320, 324 
International Reference Ellipsoid 61, 65 
Io 9 

ionosphere 307-309 ,308 
iron oxide, ternary diagram 294 
island arcs 27,247 
isochron 

rubidium-strontium 215 
uranium-lead 219 
isostasy 99-105 

Airy-Heiskanen model 101-102,104 
compensation 103-104 
discovery 99-101 
gravity anomalies 103-105 
Pratt-Hayford model 101, 102 
thermal 245B 

Vening Meinesz elastic plate model 102,104 
isotopes 
radioactive 

age determination 212-220 
source of Earth’s heat 221,228-229 
isotropy 127-128 

Joule, James Prescott (1818-89), heat energy 
221-222 

Juan de Fuca plate 31 
Juno 6 

Jupiter 4,4, 5,6,9-10,14 
magnetic field 318, 320 
satellites 3,9-10 

Kelvin, William Thomson 1st Baron 

(1824—1907), cooling of the Earth 210 
Kelvin temperature scale 221 
Kennelly-Heaviside layer 307 
Kepler, Johannes (1571-1630), laws of planetary 
motion 2,4, 50 

Konigsberger ratio 321, 327, 333 
Kuiper Belt 13B 
Kula plate 31 

Lame constants 127,129B 
landslides 158 
Laplace equation 63,65B 
Larmor precession 289, 323, 324 
laser-ranging, satellite 68-69 
latitude 64 

correction, gravity measurement 78 
Laurasia 16 

paleomagnetic reconstruction 19, 346, 349 ,349 
Laurentia, paleomagnetic reconstruction 19 
Laurussia, paleomagnetic reconstruction 19 
lead 

lead-lead isochrons 219 
uranium-lead isotopic dating 217 
Legendre polynomial 63, 64B, 65B, 67 
Lehmann discontinuity 195 
libration, lunar 8 
lightning 255-256 

lineation, magnetic 23, 24, 333, 352, 353, 354 
lithosphere 20, 20, 25-26, 26, 194,195 
deformation 106,110-111 
flexure 

oceanic islands 111-112 
subductionzone 111, 112-113 


oceanic 

cooling 231-233,243-244 
structure 246 
plates 21 

driving forces 38-40 
torque 40 

rigidity 106, 110-111 
thickness 113-114 
lodestones 281,282 

loess-paleosol sequences, magnetic susceptibility 
304 

Lorentz, Heinrich (1853-1928), magnetic 
induction 283 
Lorentz field 315 
Lorentz force 214 
Lorentz’s law 285, 286 ,289 
Love wave 136, 137, 139, 140, 147-148 
low-velocity layer 194 , 195-196 

MacCullagh’s formula 62-63 
maghemite 295,302, 304 
magma 
andesitic 27 
basaltic 26 

magnetic anistotropy 291-293 
magnetic anomaly 23-24,287, 320, 321, 327 
horizontal crustal block 330-332 
horizontal cylinder 330 
oblique dike 329-330 
ocean crust 332-334, 352-359 
Pacific Ocean 30-31, 332 
and sea-floor spreading rate 24-25, 31 
South Atlantic 312 
sphere 330 
surveying 324-325 
vertical dike 327-329 
Magnetic Field Satellite (MAGSAT) 309 
magnetic fields 283-284,285-288 

Earth see geomagnetism, Earth’s magnetic 
field 

in materials 287-288 
Moon 317-318 
planets 316, 318-320,318 
Sun 316-317 
magnetic hysteresis 290 
magnetic moment 285,287,288-290, 322, 323 
magnetic permeability 288 
magnetic pole 283-285 
dipole 284, 284 ,285 
see also geomagnetic pole 
magnetic properties 
in materials 288-293 
in rock 293-294 
magnetic relaxation 297B 
magnetic storms 309 
magnetic surveying 324-334 
airborne 325-326, 325,326 
marine 325 ,325 
magnetic susceptibility 288 
in paleoclimatology 304 
magnetism 
early studies 281-283 
environmental 303-305 
as paleoclimate indicator 304 
physics 283-293 
in rock 293-305 
as tracer 303-305 

see also geomagnetism; paleomagnetism 


magnetite 294, 302 , 303, 304 
biogenic 304—305 
magnetization 287-288, 288,289 
Earth’s crust 320-321 
induced 320-321 
remanent 299-303, 321, 334, 352 
anhysteretic 303 
chemical 301-302 
isothermal 290, 302 
measurement 338 
piezoremanent 303 
post-depositional 300-301, 336 
progressive demagnetization 338-339 
sedimentary 300-301 
shock 303 

thermoremanent 299, 335, 353 
viscous 303 
secondary 18 
spontaneous 290 
see also magnetic anomalies 
magnetocrystalline anisotropy 291-292 
magnetometers 321-324 
absorption-cell 323-324 
astatic 334 
cryogenic 338 
flux-gate 322, 323 
gradiometer 325 
proton-precession 322-323 
spinner 338 
magnetopause 307 
magnetosheath 307 
magnetosphere 306-307 ,307 
magnetostatic anisotropy 292-293 
magnetostratigraphy 351-352, 355-356 
magnetostriction 293, 303 
magnetotactic bacteria 304 
magnetotail 307 

magnetotelluric sounding 272-273 
mantle 20, 20, 186,187 ,188 
convection 199-200,247-252, 249 , 251 
models 250-251 
drag force 38 ,39 
lower 

structure 194 , 196-197 
velocity anomalies 201 
viscosity 116-117 
tomography 199-200 ,201 
true polar wander 37 ,38 
upper 

structure 194 , 195-196 
viscosity 115-116 
viscosity 114-117 
mantle flow 38 

mantle plumes 34, 197,251-252 ,251 
super-plumes 199,200 
Mars 4 , 5,6, 8-9, 14 
magnetic field 318 , 319-320 
mass spectrometry 213-214 
Maxwell, James Clerk (1831-79) 
electromagnetism 268,283 
thermodynamics 223 
melting point 223-224 
membrane polarization 266 
Mendocino fault 31, 32 
Menow meteorite 217 
Mercury 4 , 5,6, 7, 14 
magnetic field 318-319,318 
meridian arc, measurement 44-45 
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meteorites, age 219 

Michelson interferometer 74, 75-76, 75 
Mid-Atlantic Ridge 

earthquake focal mechanisms 155, 156 ,156 
gravity anomalies 97-99 
magnetic anomalies 23,24 
Milankovitch cycle 59-60, 355 
mineralization potential 257 
Mintrop wave 182 
modelling, gravity anomalies 89-94 
Mohorovicic discontinuity (Moho) 20,186-187, 
195 

Moon 4 , 6 
age 219-220 
libration 8 

magnetic field 317-318,318 
origin 210, 317 
rotation 8 

tidal periodicity 50-52 

see also Earth-Moon barycenter; Earth-Moon 
distance 

motion, planetary see planets, motion 
mountain chains, gravity anomaly 96-97 
multidomain particles 298-299 
multipole, geomagnetic potential 309, 310B 

Nabarro-Herring creep 110 
Navier-Stokes equation 248 
neap tides 53-54 
nebular hypothesis 12, 15 
Neel temperature 291,295 
Neptune 4, 4 , 5, 6, 11, 14 
magnetic field 318, 320 
trans-Neptunian objects 12,13B 
Nernst potential 257 

Nettleton’s method, density determination 82, 83 
neutron capture 216 
Newton, Sir Isaac (1642-1727) 

Philosophiae Naturalis Principia Mathematica 
45 

Universal Gravitation 3-4,44,45^46, 50 
nodal line 68 

noise, seismic 175,181 ,182 
nuclear explosions, monitoring 170-171 
Nusselt number 249 
nutation 
Euler 57 
forced 58-59 

occultation 4, 7 
Venus 7 

ocean basins, age 25 
oceanic crust 
cooling 231-233,243-244 
gravity anomaly 95-96 
heat flow 237-243, 242 

magnetic anomalies 25, 332-334, 352-359 ,357 
structure 194, 194 
oceans, age-depth variation 245B 
0RSTED satellite 309 

0rsted, Hans Christian (1777-1851), work in 
electricity and magnetism 282 
Ohm’s law 254,260 
Oort Cloud 13B 

optical pumping, magnetometer 323-324 
orbit 

elliptical 59,62 
parameters 3B 


planetary 6 
orebodies 

resistivity surveying 260 
self-potential 257-258 
oscillation 
free 137-140 ,141 
radial 138 ,139 
spheroidal 138,139 ,139 
toroidal 138-139 ,139 
oxygen isotope ratio 304, 305B 

P-wave 130-131, 131, 147-148,186-187 
equation 364 
propagation 171-172,174 
radiation patterns 152-153 
reflection 171-172 
refraction 172,174 
travel-time 149,188-191 
Pacific Ocean, triple junctions 30-32 
Pacific plate 31 ,33 
paleoclimatology 16-17 
magnetic susceptibility 304 
paleomagnetic pole 336-337 
paleomagnetism 18-20, 23, 334-349 
and apparent polar wander 345-348 
and continental drift 348-349 
direction analysis 340-341 
field tests 341-343 
measurement 337-338 
and plate tectonics 343-349 
progressive demagnetization 338-339 
reconstruction 18-19, 346-348 
and true polar wander 37-38 
Pallas 6, 9 
Pangaea 

paleomagnetic reconstruction 19, 348-349, 
348,349 

Wegener’s reconstruction 16-17 
Panthalassa 16 
parallax 1-2 ,1 
paramagnetism 289-290 
Curie temperature 291 
see also superparamagnetism 
pendulum, gravity measurement 74 
Peregrinus, Petrus (13th century), early studies in 
magnetism 281 

perigee 2 
perihelion 2, 59 
permeability 236 
magnetic 288 
permeability constant 284 
permittivity, relative 255,269 
permittivity constant 253 
Phobos 8 
phonons 225 

Picard, Jean (1620-82), meridian degree 
measurement 43-44,45 
piezomagnetism 293 
Planck’s constant 227 
planetesimals 15 
planets 

angular momentum 12,14,15 

Bode’s law 5-7 

density 4 , 5 

discovery 12 

distance from Sun 5-7 

gas giants 9-11 

magnetic fields 316, 318-320,318 


mass 4 
motion 

Kepler’s laws 2,4, 50 
prograde 4,12 
Ptolemy 1 

retrograde 4, 5, 7,11 
orbits 2, 3B, 6 
physical properties 4-5,4 
rotation 4-5,4 
sidereal period 2 
size 4,5,6 
terrestrial 7-9 
plasma 306 
plate tectonics 21 
absolute plate motions 37-38 
driving force 38-40 
Euler poles of rotation 35-38 
hotspot island chains 33-34, 37-38 
margins 21-22, 22, 25-29 
conservative 28-29 
constructive 26-27 
destructive 27-28 

earthquake focal mechanisms 155-157 
seismicity 21, 150-151 
triple junctions 29-32 
northeast Pacific Ocean 31-32 
and paleomagnetism 343-349 
reconstruction 356-358 ,357 
plutinos 13B 
Pluto 4 , 6 , 11-12,13B, 14 
retrograde motion 11 
rotation 4, 5 
Poisson solid 129B 

Poisson’s ratio 111, 124,126-127,129B, 131 
Poisson’s relation 331 

Polar Orbiting Geophysical Observatory 
(POGO) 309 
polar wander 

apparent 18, 37-38, 37,39, 345-348, 349 
true 37-38, 38,39 
polarity 

change 23-24, 349-359 
intervals 350-351 
reversal frequency 358 
transition 350 

polarization, induced 265-267 
pole, magnetic see geomagnetic pole; magnetic 
pole 

pollution, magnetic tracers 303-305 
porosity 236 
potassium 
heat production 228 

potassium-argon isotopic dating 215-216, 351 
potential 
centrifugal 49 

gravitational 46-47,48, 62-63 
magnetic 284 
natural 257-259 
potential difference 254 
power spectral analysis 309 
Prandtl number 248 
Pratt-Heyford model 101, 102 
precession 58-59 
satellites 68 

see also magnetometers, proton-precession 
pressure, hydrostatic, radial variation 193 
primary wave see P-wave 
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Ptolemy (c.90-168 AD), planetary motion 1 
pyrite 295 
pyrrhotite 295,302 

Pythagoras (582-507 BC), spherical Earth 43 

quantum theory 227,287,288 

quasars, very long baseline interferometry 73 

radar 5B 

crustal deformation measurement 71-73 
ground-penetrating 273-274 
synthetic aperture (SAR) 71-72 
radar-ranging 4 
Venus 7 
radiation 

black-body 227 
electromagnetic 269 
see also cosmic rays 
radioactivity 211 
decay 211-212,213 
heat production 228-230, 228 
radiometric age determination 212-220 
Rayleigh number 226,248-249 
Rayleigh wave 136-137,139,147-148 
ground roll 181 ,182 
rebound see elastic rebound model 
red-shift 5B 

redox potential 257-258 
reflection 

angle of 172,173 
critical 175 
P-waves 171-172,173 
subcritical 175 
supercritical 175 
reflection seismology 175-182 
at horizontal interface 176-178 
at inclined interface 178-180 
synthetic seismograms 181 
transmission coefficients 180-181 
refraction 

angle of 174,175 
critical 175 
P-waves 172,174 
refraction seismology 

at horizontal interface 182-184 
at inclined interface 184-185 
change of velocity with depth 185-186 
regolith 317 

relaxation, magnetic 297B 
remagnetization 18 

remanence see magnetization, remanent 
resistance 254 
resistivity 254,256 
apparent 263-264 
resistivity surveying 260-267 
tomography 267 

reversal, polarity see polarity, change 
Reynolds number 249 
rheology 105-117 

Richter earthquake magnitude scale 162-163 
ridges, oceanic 22,23,26 
cooling 231-233 
gravity anomalies 97-99 
magnetic anomalies 333-334, 352-359 
ridge push force 38 ,39 
see also triple junctions 
rigidity 

flexural 111, 112 


lithospheric 106, 110-111 
rigidity modulus 126 
rise and fall gravity measurement 75-76 
Rivera plate 31 
Roche limit 52, 53B, 210 
rock magnetism 293-305 
Rodrigues formula 64B 
root-zone, low-density 101-104 
rotation see Earth, rotation; planets, rotation 
rubidium-strontium isotopic dating 214-215 
rutile 294 

S-wave 131-133,147-148 
equation 364—365 
radiation patterns 152, 153 
travel-time 149,188-191 
salinity, oceanic 210-211 
San Andreas fault 31 ,32 
earthquakes 148 
satellites 
geodesy 67-73 
altimetry 69 

crustal deformation 71-73 
geoid and gravity measurement 70-71 
global positioning systems 69-70 
laser-ranging 68-69 
geomagnetometry 309 
Saturn 4 , 5,6,10, 14 
magnetic field 318, 320 
rings 10 

scattered disk objects 13B 
Schlumberger configuration 261, 262 
Schmidt polynomials 309 
sea surface measurement 69 
sea-floor spreading 22-23 ,357 
rates 24-25, 353-354 
seamounts 33 
secondary wave see S-wave 
sectorial harmonics 310B 
sediments 

as indicator of Earth’s age 211 
pelagic, paleomagnetism 336 
Sedna 13B 

seismic noise 175,181 ,182 
seismic tomography 197-201 
seismic waves 130-140 ,130 
attenuation 135-136 
body waves 130-136,174 
compressional 130-131 
energy 134-135 
equation 133, 363-365 
intensity 134-135 
partition at interface 174,187-188 
propagation 171-186 
surface waves 136-137 
transverse 131-133 
travel-times 149,188-191 
residuals 198,199B, 200B 
velocity 

anomalies 198-199,200B, 201 
in density determination 81 
see also Love wave; P-wave; Rayleigh wave; 

S-wave 

seismicity 21, 150-151 
at subduction zones 27 
seismograms 140,146-148 
phases 147-148 
synthetic 181 


seismographs, history 140-141,186 
seismology 
earthquakes 148-171 
history 121-122, 186-187 
reflection 175-182 
refraction 182-186 
seismometers 140, 141 
broadband 145-146 
damping 143-144 
equation 143,144B 
horizontal-motion 142 
long-period 144 
pendulum 141-142 
short-period 145 
strain 142 143 
vertical-motion 141-142 
seismoscope 140 
self-potential 257 
surveying 258-259 
semiconduction 254-255 
separation of variables 230,231B 
shadow zone 186,187,189-190 
shape anisotropy 292 ,293 
shear 105,124-126 
shear modulus 126-127 
shear stress 105-107,154 
shear wave see S-wave 
sidereal moment 2 
sidereal time 207-208 
single domain particles 298 
slab pull force 39 
slab resistance force 39 
Snell’s law 172,175,182 
solar energy 230-231 
solar flares 309, 317 
solar system 1-15 
angular momentum 12, 14 
orbital parameters 3B 
origin 12, 15 
solar time 207-208 
solar wind 306-307, 308, 312, 317 
solid angles 328 B 
solstice 3B 
spinel 294 

phase change 222,250 
spreading centers 22 
ridge push force 38 
spring tides 53-54 
Stefan-Boltzmann constant 227 
strain 105,122 ,123 
longitudinal 123-124,126 
matrix 123-126 
shear 124-126 
strain-hardening 107,109 
stress 105,122 ,123 
compressional 154 
hydrostatic 154 
normal 123,127 ,128 
shear 105-107,123,154 
tensional 154 
stress matrix 123 

strontium, rubidium-strontium isotopic dating 
214-215 

subduction zones 22, 26,151 
driving forces 39 

earthquake focal mechanisms 156-157 
gravity anomalies 99 
heat flow 28,246-247 
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lithospheric flexure 111, 112-113 
seismicity 27,151 
Sun 14 

cooling 209-210 
magnetic fields 316-317 
tidal effect 52-53 
sunspots 309, 317 
super-plumes 199, 200 
see also mantle plumes 
superadiabatic gradient 226, 248 
superchrons 358 ,359 
superparamagnetism 297 
surveying 
electrical 256 
resistivity 260-267 
self-potential 258-259 
electromagnetic 268-274 
induction 271-272 
magnetic 324-334 
susceptibility, magnetic 288 
suture zone 26 

telluric currents 257,259-260 
temperature 

adiabatic gradient 222,223,226 
definition 221 
inside Earth 222-224 
melting-point gradient 223-224 
terrain correction, gravity measurement 77, 
79 

tesseral harmonics 310B 
Tethys 16, 348,349 

Thales of Miletus (c.620-c.555 BC) 1,252 

thermodynamics 221-224 

thorium, heat production 228 

thunderstorms 255-256 

tidal friction, lunar 55-57, 73 

tidal lag 55 

tides 50-54 

bodily Earth-tides 50, 54 

effect on gravity measurements 54 

lunar periodicity 50-52 

solar 52-53, 73 

spring and neap 53-54 ,54 


time 

geological scale 208 ,209 
units 207-208 
Titan 10 

titanium oxide, ternary diagram 294 
titanohematite, series 294 ,295 
titanomagnetite, series 294-295, 295,302, 
353 

Titius-Bode’s law 5-7 
tomography 
electrical resistivity 267 
seismic 197-201 
torque 

lithospheric plates 40 
magnetic field 285,286-287 
tidal 55, 58 

trans-Neptunian objects 12, 13B 
transverse wave see S-wave 
travel-times 188-191 
residuals 198,199B 
trenches, oceanic 22,27 
trench suction 39 

see also subduction zones; triple junctions 
trigonometry 

Earth-Sun distance 1-2 ,1 
spherical 35B 
triple junctions 29-32 
evolution, northeast Pacific 30-32 
stability 29-30 
Triton 11 

tsunami 158-160,159B 
twotinos 13B 

ulvospinel 294 
universal time 208 
universe 
Aristotelian 1,2 
Copernican 2 
uplift 

isostatic 103 

and mantle viscosity 114-117 ,114 
uranium 

heat production 228 
uranium-lead, isotopic dating 217-219 


Uranus 4 , 6 , 10-11, 14 
magnetic field 318, 320 
rotation 4, 5 

Van Allen radiation belts 307 ,308 
velocity anomalies 198-199,200B, 201 
Vening Meinesz elastic plate model 102,104 
Venus 4 , 6 , 7, 14 
magnetic field 318 , 319 
retrograde motion 4, 7 
rotation 5 

vertical electrical sounding 262, 264-265 
Vesta 6,9 
vibration 137-138 

Vine-Matthews-Morley hypothesis 23-24,26, 352 
viscosity 106-107 
dynamic 107 
kinematic 226 
mantle 114-117 
volcanism 
hotspot 32-34 
intraplate 32 
island arc 27 
volt 254 

Volta, Alessandro (1745-1827), voltaic cell 252, 
282 

Wadati diagram 149 

Wadati-Benioff seismic zone 27,151 ,151 
wave equation 133, 363-365 
waves see seismic waves 

Wegener, Alfred Lothar (1880-1930), continental 
drift 16, 334, 348 
Weiss constant 290 
Weiss domains 298 
Wenner configuration 261,262 
work 46 
wustite 294 

yield stress see elastic limit 

Young’s modulus 107, 111, 126,127,129B, 131 

Zeeman effect 316, 323 
zircon 220 



